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DESCRIPTION 

MULT I -CHANNEL ECHO CANCEL METHOD, MULT I - CHANNEL SOUND 
TRANSFER METHOD, STEREO ECHO CANCELLER, STEREO SOUND TRANSFER 
5 APPARATUS AND TRANSFER FUNCTION CALCULATION APPARATUS 

Related Applications 

This application is a continuation of PCT 
application No. PCT/ JP02/06968 , filed July 10, 2002, which is 
10 based upon, and claims priority from, Japanese Patent 

Application No. 2001-211279, filed July 11, 2001 and Japanese 
Patent Application No. 2002-179722, filed June 20, 2002. 

Technical Field 
15 [0001] 

This invention relates to an echo cancellation 
technique for multi -channel audio signals and a transfer 
function calculation technique, and solves a problem of 
indefinite coefficient by a new technique. 

20 

Background Art 
[0002] 

In two-way stereo audio transmission that is used in 
teleconferencing systems and so forth, the problem of 
25 indefinite coefficient of echo cancellers has conventionally 
been pointed out and, for solving it, there have been 
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proposed various techniques (see Journal of The Institute of 
Electronics, Information and Communication Engineers, vol. 81, 
No. 3, pp. 266-274, March 1998). As one of the techniques 
for solving the problem of indefinite coefficient, there is a 
5 method of reducing the interchannel correlation. As concrete 
techniques therefor, there have conventionally been proposed 
the addition of random noise, the correlation removal by 
filters, the interchannel frequency shift, the use of an 
interleave comb filter, the nonlinear processing (Laid-Open 
10 Patent Publication No. H10-190848) and so forth. 
[0003] 

According to the foregoing conventional techniques , 
since original stereo signals are subjected to processing and 
then reproduced, there has been a problem that deterioration 

15 more or less occurs in reproduced signals. Further, when the 
processing is complicated, a delay occurs in the reproduced 
signals, so that there has been a problem of difficulty in 
conversation in the teleconference and so forth. Further, 
when the processing is complicated and the processing 

20 capability of a processing circuit is low, there have been 

those instances where it is difficult to update a coefficient 
of an echo canceller in real time while carrying out the echo 
cancel processing . 

This invention provides an echo cancellation 

25 technique for multi-channel audio signals and a transfer 

function calculation technique that have solved the foregoing 
problems in the conventional techniques. 
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Disclosure of the Invention 
[0004] 

[Inventions of Claims 1 to 8 and Inventions relating 
5 to such Inventions] 

A multi- channel echo cancel method of this invention 
is a method wherein, with respect to a space provided therein 
with a plurality of loudspeakers and one or a plurality of 
microphones and forming a plurality of audio transfer systems 

10 in which multi-channel sounds inputted from an outside and 
reproduced by said respective loudspeakers and having a 
correlation with each other are collected by said microphones, 
individual transfer functions of said plurality of audio 
transfer systems or a plurality of composite transfer 

15 functions obtained by suitably combining said individual 

transfer functions are estimated so as to set corresponding 
filter characteristics, respectively, echo cancel signals are 
respectively produced by giving said set filter 
characteristics to corresponding individual signals to be 

2 0 reproduced by said respective loudspeakers or a plurality of 
composite signals obtained by suitably combining said 
individual signals, and said echo cancel signals are 
subtracted from corresponding individual collected audio 
signals of said one or plurality of microphones, or a 

25 plurality of composite signals obtained by suitably combining 
said individual collected audio signals, thereby performing 
echo cancellation, and wherein, using as reference signals 
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(representing those signals that are referred to for 
estimating the transfer functions or the composite transfer 
functions) a set of a plurality of low-correlation composite 
signals which correspond to signals obtained by suitably 
5 combining said multi-channel audio signals and which have a 
lower correlation with each other than that between said 
multi-channel audio signals (e.g. suitably combining said 
multi-channel audio signals to produce a plurality of low- 
correlation composite signals having a lower correlation with 

10 each other than that between said mult i -channel audio signals 
and using a set of said plurality of low-correlation 
composite signals as reference signals, or directly inputting 
a set of a plurality of low-correlation composite signals 
which correspond to signals obtained by suitably combining 

15 said multi- channel audio signals and which have a lower 

correlation with each other than that between said multi- 
channel audio signals and using the set of said plurality of 
low-correlation composite signals as reference signals, or 
the like) , individual transfer functions of the respective 

20 audio transfer systems or a plurality of composite transfer 
functions obtained by suitably combining said individual 
transfer functions are respectively derived, thereby to set 
corresponding filter characteristics. According to this 
invention, using as reference signals a set of a plurality of 

25 low-correlation composite signals which correspond to signals 
obtained by suitably combining multi -channel audio signals 
having a correlation therebetween and which have a lower 
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correlation with each other than that between such multi- 
channel audio signals, individual transfer functions of the 
respective audio transfer systems or a plurality of composite 
transfer functions obtained by suitably combining such 
5 individual transfer functions are respectively derived, and 
corresponding filter characteristics are set, thereby to 
enable echo cancellation. In accordance therewith, since the 
multi- channel audio signals can be reproduced from the 
loudspeakers with no or less processing, which induces 

10 deterioration, applied to the multi-channel audio signals, 

excellent reproduced tone quality can be achieved. Further, 
there is no or only a small delay in reproduced signals. 
Thus, when applying to the teleconferencing system or the 
like, natural conversation can be conducted. The calculation 

15 of respectively deriving the individual transfer functions of 
the respective audio transfer systems or the plurality of 
composite transfer functions obtained by suitably combining 
said individual transfer functions, using as the reference 
signals the set of the plurality of low-correlation composite 

2 0 signals, may be, for example, a calculation of respectively 
deriving the individual transfer functions of the respective 
audio transfer systems or the plurality of composite transfer 
functions obtained by suitably combining said individual 
transfer functions, based on a cross -spectrum calculation 

25 between the plurality of low-correlation composite signals 

and the individual collected audio signals of the microphones, 
or the plurality of composite signals obtained by suitably 
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combining said individual collected audio signals. Further, 
the calculation of respectively deriving the individual 
transfer functions of said plurality of audio transfer 
systems or the plurality of composite transfer functions 
5 obtained by suitably combining said individual transfer 

functions, based on said cross -spectrum calculation, may be, 
for example, a calculation of respectively deriving the 
individual transfer functions of said plurality of audio 
transfer systems or the plurality of composite transfer 

10 functions obtained by suitably combining said individual 
transfer functions, by combining said multi- channel audio 
signals through addition or subtraction to produce a 
plurality of low-correlation composite signals having a lower 
correlation with each other than that between said multi - 

15 channel audio signals, deriving cross spectra between said 
plurality of low-correlation composite signals and the 
individual collected audio signals of the microphones, or the 
plurality of composite signals obtained by suitably combining 
said individual collected audio signals, and ensemble- 

20 averaging them in a predetermined time period per cross 
spectrum. 

[0005] 

A multi-channel echo cancel method of this invention 
is a method wherein, with respect to a space provided therein 
25 with a plurality of loudspeakers and one or a plurality of 

microphones and forming a plurality of audio transfer systems 
in which multi-channel sounds inputted from an outside and 
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reproduced by said respective loudspeakers and having a 
correlation with each other are collected by said microphones, 
individual transfer functions of said plurality of audio 
transfer systems or a plurality of composite transfer 
5 functions obtained by suitably combining said individual 

transfer functions are estimated so as to set corresponding 
filter characteristics, respectively, echo cancel signals are 
respectively produced by giving said set filter 
characteristics to corresponding individual signals to be 

10 reproduced by said respective loudspeakers or a plurality of 
composite signals obtained by suitably combining said 
individual signals, and said echo cancel signals are 
subtracted from corresponding individual collected audio 
signals of said one or plurality of microphones, or a 

15 plurality of composite signals obtained by suitably combining 
said individual collected audio signals, thereby performing 
echo cancellation, wherein, using as reference signals a set 
of a plurality of low-correlation composite signals which 
correspond to signals obtained by suitably combining said 

20 multi -channel audio signals and which have a lower 

correlation with each other than that between said multi- 
channel audio signals (e.g. suitably combining said multi- 
channel audio signals to produce a plurality of low- 
correlation composite signals having a lower correlation with 

25 each other than that between said multi -channel audio signals 
and using a set of said plurality of low-correlation 
composite signals as reference signals, or directly inputting 
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a set of a plurality of low-correlation composite signals 
which correspond to signals obtained by suitably combining 
said multi -channel audio signals and which have a lower 
correlation with each other than that between said multi- 
5 channel audio signals and using the set of said plurality of 
low-correlation composite signals as reference signals, or 
the like) , estimated errors of individual transfer functions 
of the respective audio transfer systems or a plurality of 
composite transfer functions obtained by suitably combining 

10 said individual transfer functions are respectively derived, 
thereby to update corresponding filter characteristics to 
values that cancel said estimated errors. According to this 
invention, using as reference signals a set of a plurality of 
low-correlation composite signals which correspond to signals 

15 obtained by suitably combining multi-channel audio signals 
having a correlation therebetween and which have a lower 
correlation with each other than that between such multi- 
channel audio signals, estimated errors of individual 
transfer functions of the respective audio transfer systems 

20 or a plurality of composite transfer functions obtained by 
suitably combining such individual transfer functions are 
respectively derived so as to successively update the 
corresponding filter characteristics to values that cancel 
such estimated errors, thereby to enable echo cancellation. 

25 In accordance therewith, since the multi-channel audio 

signals can be reproduced from the loudspeakers with no or 
less processing, which induces deterioration, applied to the 
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multi -channel audio signals, excellent reproduced tone 
quality can be achieved. Further, there is no or only a 
small delay in reproduced signals. Thus, when applying to 
the teleconferencing system or the like, natural conversation 
5 can be conducted. Further, it is possible to update the 
filter characteristics in real time. The calculation of 
respectively deriving the estimated errors of the individual 
transfer functions of the respective audio transfer systems 
or the plurality of composite transfer functions obtained by 

10 suitably combining said individual transfer functions, using 
as the reference signals the set of the plurality of low- 
correlation composite signals, may be, for example, a 
calculation of respectively deriving the estimated errors of 
the individual transfer functions of the respective audio 

15 transfer systems or the plurality of composite transfer 
functions obtained by suitably combining said individual 
transfer functions, based on a cross-spectrum calculation 
between said plurality of low-correlation composite signals 
and echo cancel error signals obtained by subtracting the 

2 0 echo cancel signals from the corresponding individual 
collected audio signals of said one or plurality of 
microphones, or the plurality of composite signals obtained 
by suitably combining said individual collected audio signals. 
Further, the calculation of respectively deriving the 

25 estimated errors of the individual transfer functions of said 
plurality of audio transfer systems or the plurality of 
composite transfer functions obtained by suitably combining 
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said individual transfer functions, based on said cross- 
spectrum calculation, may be, for example, a calculation of 
respectively deriving the estimated errors of the individual 
transfer functions of said plurality of audio transfer 
5 systems or the plurality of composite transfer functions 
obtained by suitably combining said individual transfer 
functions, by combining said multi-channel audio signals 
through addition or subtraction to produce a plurality of 
low-correlation composite signals having a lower correlation 

10 with each other than that between said multi-channel audio 
signals, deriving cross spectra between said plurality of 
low-correlation composite signals and the echo cancel error 
signals obtained by subtracting the echo cancel signals from 
the corresponding individual collected audio signals of said 

15 one or plurality of microphones, or the plurality of 
composite signals obtained by suitably combining said 
individual collected audio signals, and ensemble -averaging 
them in a predetermined time period per cross spectrum. 
Further, the correlation between said plurality of low- 

20 correlation composite signals is detected and, when a value 
of said correlation is no less than a prescribed value, 
updating of said filter characteristics is stopped, thereby 
to prevent the echo cancel error signals from unexpectedly 
increasing . 

25 [0006] 

A multi-channel sound transfer method of this 
invention is such that, with respect to two spaces each 
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forming said plurality of audio transfer systems, any of the 
foregoing multi-channel echo cancel methods is carried out, 
so that the mult i -channel audio signals, which have been 
echo-canceled by performing said method, are transmitted 
5 between said two spaces. In accordance therewith, the multi- 
channel audio transmission with reduced echo cancellation can 
be performed between two spots, which, for example, can be 
applied to the teleconferencing system or the like. 
[0007] 

10 A stereo echo cancel method of this invention is a 

method wherein, with respect to a space provided therein with 
two loudspeakers and one or two microphones and forming two 
or four audio transfer systems in which stereo sounds 
reproduced by said respective loudspeakers are collected by 

15 said microphones, individual transfer functions of said two 
or four audio transfer systems or a plurality of composite 
transfer functions obtained by suitably combining said 
individual transfer functions are estimated so as to set 
corresponding filter characteristics, respectively, echo 

20 cancel signals are respectively produced by giving said set 

filter characteristics to corresponding individual signals to 
be reproduced by said respective loudspeakers or a plurality 
of composite signals obtained by suitably combining said 
individual signals, and said echo cancel signals are 

2 5 subtracted from corresponding individual collected audio 
signals of said one or two microphones, or a plurality of 
composite signals obtained by suitably combining said 
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individual collected audio signals, thereby performing echo 
cancellation, and wherein, using a sum signal and a 
difference signal of said stereo audio signals as reference 
signals, individual transfer functions of said two or four 
5 audio transfer systems or a plurality of composite transfer 
functions obtained by suitably combining said individual 
transfer functions are respectively derived, thereby to set 
corresponding filter characteristics. According to this 
invention, since the sum signal and the difference signal of 

10 the stereo audio signals have a low correlation therebetween, 
the transfer functions of the two or four audio transfer 
systems or their composite transfer functions are 
respectively derived using the sum signal and the difference 
signal as reference signals, so as to set the corresponding 

15 filter characteristics, thereby to enable echo cancellation. 
In accordance therewith, since the stereo signals can be 
reproduced from the loudspeakers with no or less processing, 
which induces deterioration, applied to the stereo signals, 
excellent reproduced tone quality can be achieved. Further, 

20 there is no or only a small delay in reproduced signals. 

Thus, when applying to the teleconferencing system or the 
like, natural conversation can be conducted. The calculation 
of respectively deriving the individual transfer functions of 
said two or four audio transfer systems or the plurality of 

25 composite transfer functions obtained by suitably combining 
said individual transfer functions, using the sum signal and 
the difference signal of said stereo audio signals as the 
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reference signals, may be, for example, a calculation of 
respectively deriving the individual transfer functions of 
said two or four audio transfer systems or the plurality of 
composite transfer functions obtained by suitably combining 
5 said individual transfer functions, based on a cross-spectrum 
calculation between the sum signal and the difference signal, 
and the individual collected audio signals of the microphones, 
or the plurality of composite signals obtained by suitably 
combining said individual collected audio signals. Further, 

10 the calculation of respectively deriving the individual 

transfer functions of said two or four audio transfer systems 
or the plurality of composite transfer functions obtained by 
suitably combining said individual transfer functions, based 
on said cross-spectrum calculation, may be, for example, a 

15 calculation of respectively deriving the individual transfer 
functions of said two or four audio transfer systems or the 
plurality of composite transfer functions obtained by 
suitably combining said individual transfer functions, by 
deriving cross spectra between the sum signal and the 

20 difference signal of said stereo audio signals and the 

individual collected audio signals of the microphones or the 
plurality of composite signals obtained by suitably combining 
said individual collected audio signals, and ensemble- 
averaging them in a predetermined time period per cross 

25 spectrum. 

[0008] 

A stereo echo cancel method of this invention is a 
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method wherein, with respect to a space provided therein with 
two loudspeakers and one or two microphones and forming two 
or four audio transfer systems in which stereo sounds 
reproduced by said respective loudspeakers are collected by 
5 said microphones, individual transfer functions of said two 
or four audio transfer systems or a plurality of composite 
transfer functions obtained by suitably combining said 
individual transfer functions are estimated so as to set 
corresponding filter characteristics, respectively, echo 

10 cancel signals are respectively produced by giving said set 

filter characteristics to corresponding individual signals to 
be reproduced by said respective loudspeakers or a plurality 
of composite signals obtained by suitably combining said 
individual signals, and said echo cancel signals are 

15 subtracted from corresponding individual collected audio 
signals of said one or two microphones, or a plurality of 
composite signals obtained by suitably combining said 
individual collected audio signals, thereby performing echo 
cancellation, and wherein, using a sum signal and a 

20 difference signal of said stereo audio signals as reference 

signals, estimated errors of individual transfer functions of 
said two or four audio transfer systems or a plurality of 
composite transfer functions obtained by suitably combining 
said individual transfer functions are respectively derived, 

25 thereby to update corresponding filter characteristics to 

values that cancel said estimated errors. According to this 
invention, since the sum signal and the difference signal of 
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the stereo audio signals have a low correlation therebetween, 
the estimated errors of the transfer functions of the two or 
four audio transfer systems or their composite transfer 
functions are respectively derived using the sum signal and 
5 the difference signal as reference signals, so as to 

successively update the corresponding filter characteristics 
to the values that cancel the estimated errors, thereby to 
enable echo cancellation. In accordance therewith, since the 
stereo signals can be reproduced from the loudspeakers with 

10 no or less processing, which induces deterioration, applied 
to the stereo signals, excellent reproduced tone quality can 
be achieved. Further, there is no or only a small delay in 
reproduced signals. Thus, when applying to the 
teleconferencing system or the like, natural conversation can 

15 be conducted. The calculation of respectively deriving the 

estimated errors of the individual transfer functions of said 
two or four audio transfer systems or the plurality of 
composite transfer functions obtained by suitably combining 
said individual transfer functions, using the sum signal and 

20 the difference signal of said stereo audio signals as the 
reference signals, may be, for example, a calculation of 
respectively deriving the estimated errors of the individual 
transfer functions of said two or four audio transfer systems 
or the plurality of composite transfer functions obtained by 

25 suitably combining said individual transfer functions, based 
on a cross -spectrum calculation between the sum signal and 
the difference signal of said stereo audio signals and 
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respective echo cancel error signals obtained by subtracting 
the corresponding echo cancel signals from the individual 
collected audio signals of said one or two microphones, or 
the plurality of composite signals obtained by suitably 
5 combining said individual collected audio signals. The 

calculation of respectively deriving the estimated errors of 
the individual transfer functions of said two or four audio 
transfer systems or the plurality of composite transfer 
functions obtained by suitably combining said individual 

10 transfer functions, based on the cross-spectrum calculation 
between the sum signal and the difference signal of said 
stereo audio signals and said echo cancel error signals, may 
be, for example, a calculation of respectively deriving the 
estimated errors of the individual transfer functions of said 

15 two or four audio transfer systems or the plurality of 

composite transfer functions obtained by suitably combining 
said individual transfer functions, by deriving cross spectra 
between the sum signal and the difference signal of said 
stereo audio signals and said echo cancel error signals, and 

20 ensemble-averaging them in a predetermined time period per 
cross spectrum. Further, the correlation between the sum 
signal and the difference signal of said stereo audio signals 
is detected and, when a value of said correlation is no less 
than a prescribed value, updating of said filter 

25 characteristics is stopped, thereby to prevent the echo 
cancel error signals from unexpectedly increasing. 
[0009] 
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A stereo audio transmission method of this invention 
is such that, with respect to two spaces each forming said 
four audio transfer systems, any of the foregoing multi- 
channel echo cancel methods is carried out, so that the 
5 stereo audio signals, which have been echo-canceled by 

performing said method, are transmitted between said two 
spaces. In accordance therewith, the stereo audio 
transmission with reduced echo cancellation can be performed 
between two spots, which, for example, can be applied to the 
10 teleconferencing system or the like. 
[0010] 

In this invention, the plurality of composite 
signals obtained by suitably combining the individual signals 
to be reproduced by the respective loudspeakers, and the 
15 plurality of low-correlation composite signals used as 

reference signals {in this specification, "low-correlation 
composite signals'* is used as including the meaning of 
uncorrelated signals (uncorrelated composite signals) } may be, 
for example, common signals. 
20 [0011] 

[Inventions of Claims 9 to 21 and Inventions 
relating to such Inventions] 

A multi-channel echo cancel method of this invention 
is a method wherein, with respect to a space provided therein 
25 with a plurality of loudspeakers and one or a plurality of 

microphones and forming a plurality of audio transfer systems 
in which multi-channel sounds (e.g. multi-channel stereo 
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sounds such as two-channel, four-channel, ...) inputted from 
an outside and reproduced by said respective loudspeakers and 
having a correlation with each other are collected by said 
microphones, transfer functions of said plurality of audio 
5 transfer systems are estimated so as to set corresponding 

filter characteristics, respectively, echo cancel signals are 
respectively produced by giving said filter characteristics 
to corresponding signals to be reproduced by said respective 
loudspeakers, and said echo cancel signals are subtracted 

10 from corresponding collected audio signals of said one or 
plurality of microphones, thereby performing echo 
cancellation, and wherein, using as reference signals a set 
of a plurality of low-correlation composite signals which 
correspond to signals obtained by suitably combining said 

15 multi-channel audio signals and which have a lower 

correlation with each other than that between said multi- 
channel audio signals (e.g. suitably combining said multi- 
channel audio signals to produce a plurality of low- 
correlation composite signals having a lower correlation with 

20 each other than that between said multi-channel audio signals 
and using a set of said plurality of low-correlation 
composite signals as reference signals, or directly inputting 
a set of a plurality of low-correlation composite signals 
which correspond to signals obtained by suitably combining 

25 said multi-channel audio signals and which have a lower 

correlation with each other than that between said multi- 
channel audio signals and using the set of said plurality of 
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low-correlation composite signals as reference signals, or 
the like) , transfer functions of the respective audio 
transfer systems are respectively derived, thereby to set 
corresponding filter characteristics. According to this 
5 invention, using as reference signals a set of a plurality of 
low-correlation composite signals which correspond to signals 
obtained by suitably combining multi -channel audio signals 
having a correlation therebetween and which have a lower 
correlation with each other than that between such multi - 

10 channel audio signals, the transfer functions of the 

respective audio transfer systems are respectively derived, 
and corresponding filter characteristics are set, thereby to 
enable echo cancellation. In accordance therewith, since the 
multi-channel audio signals can be reproduced from the 

15 loudspeakers with no or less processing, which induces 

deterioration, applied to the multi-channel audio signals, 
excellent reproduced tone quality can be achieved. Further, 
there is no or only a small delay in reproduced signals. 
Thus, when applying to the teleconferencing system or the 

20 like, natural conversation can be conducted. The calculation 
of respectively deriving the transfer functions of the 
respective audio transfer systems using as the reference 
signals the set of the plurality of low-correlation composite 
signals, may be, for example, a calculation of respectively 

25 deriving the transfer functions of the respective audio 
transfer systems based on a cross -spectrum calculation 
between the plurality of low-correlation composite signals 
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and the respective microphone collected audio signals. 
Further, the calculation of respectively deriving the 
transfer functions of said plurality of audio transfer 
systems based on said cross-spectrum calculation, may be, for 
5 example, a calculation of combining said multi-channel audio 
signals through addition or subtraction to produce a 
plurality of low-correlation composite signals having a lower 
correlation with each other than that between said multi- 
channel audio signals, deriving cross spectra between said 

10 plurality of low-correlation composite signals and the 

respective microphone collected audio signals, and ensemble- 
averaging them in a predetermined time period per cross 
spectrum to derive a plurality of kinds of composite transfer 
functions obtained by combining transfer functions of a 

15 plurality of suitable systems among said plurality of audio 
transfer systems, thereby to derive transfer functions of 
said plurality of audio transfer systems based on said 
plurality of kinds of composite transfer functions. 
[0012] 

20 A multi-channel echo cancel method of this invention 

is a method wherein, with respect to a space provided therein 
with a plurality of loudspeakers and one or a plurality of 
microphones and forming a plurality of audio transfer systems 
in which multi -channel sounds inputted from an outside and 

2 5 reproduced by said respective loudspeakers and having a 

correlation with each other are collected by said microphones, 
transfer functions of said plurality of audio transfer 



- 21 - 



systems are estimated so as to set corresponding filter 
characteristics, respectively, echo cancel signals are 
respectively produced by giving said filter characteristics 
to corresponding signals to be reproduced by said respective 
5 loudspeakers, and said echo cancel signals are subtracted 
from corresponding collected audio signals of said one or 
plurality of microphones, thereby performing echo 
cancellation, and wherein, using as reference signals a set 
of a plurality of low-correlation composite signals which 

10 correspond to signals obtained by suitably combining said 
multi-channel audio signals and which have a lower 
correlation with each other than that between said multi- 
channel audio signals (e.g. suitably combining said multi- 
channel audio signals to produce a plurality of low- 

15 correlation composite signals having a lower correlation with 
each other than that between said multi-channel audio signals 
and using a set of said plurality of low-correlation 
composite signals as reference signals, or directly inputting 
a set of a plurality of low-correlation composite signals 

20 which correspond to signals obtained by suitably combining 
said multi-channel audio signals and which have a lower 
correlation with each other than that between said multi- 
channel audio signals and using the set of said plurality of 
low-correlation composite signals as reference signals, or 

25 the like) , estimated errors of transfer functions of the 

respective audio transfer systems are respectively derived, 
thereby to update corresponding filter characteristics to 
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values that cancel said estimated errors. According to this 
invention, using as reference signals a set of a plurality of 
low-correlation composite signals which correspond to signals 
obtained by suitably combining multi-channel audio signals 
5 having a correlation therebetween and which have a lower 
correlation with each other than that between such multi- 
channel audio signals, the estimated errors of the transfer 
functions of the respective audio transfer systems are 
respectively derived so as to successively update the 

10 corresponding filter characteristics to the values that 
cancel said estimated errors, .thereby to enable echo 
cancellation. In accordance therewith, since the multi- 
channel audio signals can be reproduced from the loudspeakers 
with no or less processing, which induces deterioration, 

15 applied to the multi-channel audio signals, excellent 

reproduced tone quality can be achieved. Further, there is 
no or only a small delay in reproduced signals. Thus, when 
applying to the teleconferencing system or the like, natural 
conversation can be conducted. It is also possible to update 

20 the filter characteristics in real time. The filter 

characteristics can be updated, for example, per suitably 
determined prescribed time period (e.g. time period of 
performing said ensemble averaging) . The calculation of 
respectively deriving the estimated errors of the transfer 

25 functions of the respective audio transfer systems using the 
set of the plurality of low-correlation composite signals as 
the reference signals, may be, for example, a calculation of 
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respectively deriving the estimated errors of the transfer 
functions of the respective audio transfer systems based on a 
cross-spectrum calculation between said plurality of low- 
correlation composite signals and echo cancel error signals 
5 obtained by subtracting the corresponding echo cancel signals 
from the collected audio signals of said one or plurality of 
microphones. Further, the calculation of respectively 
deriving the estimated errors of the transfer functions of 
said plurality of audio transfer systems based on said cross- 

10 spectrum calculation, may be, for example, a calculation of 
combining said multi-channel audio signals through addition 
or subtraction to produce a plurality of low-correlation 
composite signals having a lower correlation with each other 
than that between said mult i -channel audio signals, deriving 

15 cross spectra between said plurality of low-correlation 

composite signals and the echo cancel error signals obtained 
by subtracting the corresponding echo cancel signals from the 
collected audio signals of said one or plurality of 
microphones, and ensemble -averaging them in a predetermined 

20 time period per cross spectrum to derive a plurality of kinds 
of transfer function composite estimated errors obtained by 
combining estimated errors of transfer functions of a 
plurality of suitable systems among said plurality of audio 
transfer systems, thereby to derive estimated errors of the 

25 transfer functions of said plurality of audio transfer 

systems based on said plurality of kinds of transfer function 
composite estimated errors. Further, the correlation between 
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said plurality of low-correlation composite signals is 
detected and, when a value of said correlation is no less 
than a prescribed value, updating of said filter 
characteristics is stopped, thereby to prevent the echo 
5 cancel error signals from unexpectedly increasing. 
[0013] 

Further, the calculation of respectively deriving 
the transfer functions of said plurality of audio transfer 
systems based on said cross-spectrum calculation, may be, for 

10 example, a calculation of producing a plurality of mutually 
orthogonal uncorrelated composite signals by applying a 
principal component analysis to said mult i -channel audio 
signals, deriving cross spectra between said plurality of 
uncorrelated composite signals and the respective microphone 

15 collected audio signals, and ensemble- averaging them in a 
predetermined time period per cross spectrum, thereby to 
derive the transfer functions of said plurality of audio 
transfer systems based on the ensemble -averaged values. 
[0014] 

20 Further, the calculation of respectively deriving 

the estimated errors of the transfer functions of said 
plurality of audio transfer systems based on said cross- 
spectrum calculation, may be, for example, a calculation of 
producing a plurality of mutually orthogonal uncorrelated 

25 composite signals by applying a principal component analysis 
to said mult i -channel audio signals, deriving cross spectra 
between said plurality of uncorrelated composite signals and 
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the echo cancel error signals obtained by subtracting the 
corresponding echo cancel signals from the collected audio 
signals of said one or plurality of microphones, and 
ensemble-averaging them in a predetermined time period per 
5 cross spectrum, thereby to derive the estimated errors of the 
transfer functions of said plurality of audio transfer 
systems based on the ensemble-averaged values. In this case, 
it may be arranged that double talk in which sounds other 
than those reproduced by said loudspeakers are inputted into 

10 said microphones is detected and, when the double talk is 

- detected, an update period of said filter characteristics is 
made relatively longer, whereas, when the double talk is not 
detected, the update period of said filter characteristics is 
made relatively shorter, so that it is possible to fully 

15 converge the estimated errors when the double talk exists, 

and further, quicken the convergence of the estimated errors 
when there is no double talk. 
[0015] 

A multi-channel sound transfer method of this 
20 invention is such that, with respect to two spaces each 

forming said plurality of audio transfer systems, any of the 
foregoing multi-channel echo cancel methods is carried out, 
so that the multi -channel audio signals, which have been 
echo-canceled by performing said method, are transmitted 
25 between said two spaces. In accordance therewith, the multi- 
channel audio transmission with reduced echo cancellation can 
be performed between two spots, which, for example, can be 
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applied to the teleconferencing system or the like. 
[0016] 

A stereo echo cancel method of this invention is a 
method wherein, with respect to a space provided therein with 
5 two loudspeakers and one or two microphones and forming two 
or four audio transfer systems in which stereo sounds 
reproduced by said respective loudspeakers are collected by 
said microphones, transfer functions of said two or four 
audio transfer systems are estimated so as to set 

10 corresponding filter characteristics, respectively, echo 
cancel signals are respectively produced by giving said 
filter characteristics to corresponding signals to be 
reproduced by said respective loudspeakers, and said echo 
cancel signals are subtracted from corresponding collected 

15 audio signals of said one or two microphones, thereby 

performing echo cancellation, and wherein, using a sum signal 
and a difference signal of said stereo audio signals as 
reference signals, transfer functions of said two or four 
audio transfer systems are respectively derived, thereby to 

20 set corresponding filter characteristics. According to this 
invention, since the sum signal and the difference signal of 
the stereo audio signals have a low correlation therebetween, 
the transfer functions of the two or four audio transfer 
systems are respectively derived using the sum signal and the 

25 difference signal as reference signals, so as to set the 

corresponding filter characteristics, thereby to enable echo 
cancellation. In accordance therewith, since the stereo 
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signals can be reproduced from the loudspeakers with no or 
less processing, which induces deterioration, applied to the 
stereo signals, excellent reproduced tone quality can be 
achieved. Further, there is no or only a small delay in 
5 reproduced signals. Thus, when applying to the 

teleconferencing system or the like, natural conversation can 
be conducted. The calculation of respectively deriving the 
transfer functions of said two or four audio transfer systems 
using the sum signal and the difference signal of said stereo 

10 audio signals as the reference signals, may be, for example, 
a calculation of respectively deriving the transfer functions 
of said two or four audio transfer systems based on a cross- 
spectrum calculation between the sum signal and the 
difference signal, and the respective microphone collected 

15 audio signals. Further, the calculation of respectively 
deriving the transfer functions of said two or four audio 
transfer systems based on said cross-spectrum calculation, 
may be, for example, a calculation of deriving cross spectra 
between the sum signal and the difference signal of said 

20 stereo audio signals and the respective microphone collected 
audio signals, and ensemble-averaging them in a predetermined 
time period per cross spectrum to derive a plurality of kinds 
of composite transfer functions obtained by combining 
transfer functions of a plurality of suitable systems among 

25 said two or four audio transfer systems, thereby to derive 

transfer functions of said two or four audio transfer systems 
based on said plurality of kinds of composite transfer 
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functions. Further, the cross -spectrum calculation in case 
of the four systems may calculate, for example, respective 
cross spectra between said sum signal and the first 
microphone collected audio signal, between said sum signal 
5 and the second microphone collected audio signal, between 
said difference signal and the first microphone collected 
audio signal, and between said difference signal and the 
second microphone collected audio signal. Further, said 
composite transfer functions may include, for example, the 

10 first composite transfer function that is the sum of a 

transfer function between the first loudspeaker and the first 
microphone and a transfer function between the second 
loudspeaker and the first microphone, the second composite 
transfer function that is a difference between the transfer 

15 function between the first loudspeaker and the first 

microphone and the transfer function between the second 
loudspeaker and the first microphone, the third composite 
transfer function that is the sum of a transfer function 
between the first loudspeaker and the second microphone and a 

20 transfer function between the second loudspeaker and the 

second microphone, and the fourth composite transfer function 
that is a difference between the transfer function between 
the first loudspeaker and the second microphone and the 
transfer function between the second loudspeaker and the 

25 second microphone. The calculation of deriving the transfer 
functions of said four audio transfer systems may include, 
for example, a calculation of deriving a transfer function of 
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the first audio transfer system from the sum of the first 
composite transfer function and the second composite transfer 
function, a calculation of deriving a transfer function of 
the second audio transfer system from a difference between 
5 the first composite transfer function and the second 

composite transfer function, a calculation of deriving a 
transfer function of the third audio transfer system from the 
sum of the third composite transfer function and the fourth 
composite transfer function, and a calculation of deriving a 
10 transfer function of the fourth audio transfer system from a 
difference between the third composite transfer function and 
the fourth composite transfer function. 
[0017] 

A stereo echo cancel method of this invention is a 
15 method wherein, with respect to a space provided therein with 
two loudspeakers and one or two microphones and forming two 
or four audio transfer systems in which stereo sounds 
reproduced by said respective loudspeakers are collected by 
said microphones, transfer functions of said two or four 
20 audio transfer systems are estimated so as to set 

corresponding filter characteristics, respectively, echo 
cancel signals are respectively produced by giving said 
filter characteristics to corresponding signals to be 
reproduced by said respective loudspeakers, and said echo 
25 cancel signals are subtracted from corresponding collected 
audio signals of said one or two microphones, thereby 
performing echo cancellation, and wherein, using a sum signal 
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and a difference signal of said stereo audio signals as 
reference signals, estimated errors of transfer functions of 
said two or four audio transfer systems are respectively 
derived, thereby to update corresponding filter 
5 characteristics to values that cancel said estimated errors. 
According to this invention, since the sum signal and the 
difference signal of the stereo audio signals have a low 
correlation therebetween, the estimated errors of the 
transfer functions of the two or four audio transfer systems 

10 are respectively derived using the sum signal and the 

difference signal as reference signals, so as to successively 
update the corresponding filter characteristics to the values 
that cancel the estimated errors, thereby to enable echo 
cancellation. In accordance therewith, since the stereo 

15 signals can be reproduced from the loudspeakers with no or 

less processing, which induces deterioration, applied to the 
stereo signals, excellent reproduced tone quality can be 
achieved. Further, there is no or only a small delay in 
reproduced signals. Thus, when applying to the 

20 teleconferencing system or the like, natural conversation can 
be conducted. It is also possible to update the echo cancel 
coefficients (filter characteristics) in real time. The 
filter characteristics can be updated, for example, per 
suitably determined prescribed time period (e.g. time period 

25 of performing said ensemble averaging) . The calculation of 
respectively deriving the estimated errors of the transfer 
functions of said two or four audio transfer systems using 
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the sum signal and the difference signal of said stereo audio 
signals as the reference signals, may be, for example, a 
calculation of respectively deriving the estimated errors of 
the transfer functions of said two or four audio transfer 
5 systems based on a cross -spectrum calculation between the sum 
signal and the difference signal of said stereo audio signals 
and respective echo cancel error signals obtained by 
subtracting the corresponding echo cancel signals from the 
collected audio signals of said one or two microphones. 

10 Further, the calculation of respectively deriving the 

estimated errors of the transfer functions of said two or 
four audio transfer systems based on the cross-spectrum 
calculation between the sum signal and the difference signal 
of said stereo audio signals and said echo cancel error 

15 signals, may be, for example, a calculation of deriving cross 
spectra between the sum signal and the difference signal of 
said stereo audio signals and said echo cancel error signals, 
and ensemble-averaging them in a predetermined time period 
per cross spectrum to derive a plurality of kinds of transfer 

20 function composite estimated errors obtained by combining 
estimated errors of transfer functions of a plurality of 
suitable systems among said two or four audio transfer 
systems, thereby to derive estimated errors of the transfer 
functions of said two or four audio transfer systems based on 

25 said plurality of kinds of transfer function composite 

estimated errors. Further, the cross -spectrum calculation in 
case of the four systems may calculate, for example, 
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respective cross spectra between said sum signal and the 
first echo cancel error signal, between said sum signal and 
the second echo cancel error signal, between said difference 
signal and the first echo cancel error signal, and between 
5 said difference signal and the second echo cancel error 

signal. Further, said transfer function composite estimated 
errors may include, for example, the first transfer function 
composite estimated error that is the sum of an estimated 
error of a transfer function between the first loudspeaker 

10 and the first microphone and an estimated error of a transfer 
function between the second loudspeaker and the first 
microphone, the second transfer function composite estimated 
error that is a difference between the estimated error of the 
transfer function between the first loudspeaker and the first 

15 microphone and the estimated error of the transfer function 
between the second loudspeaker and the first microphone, the 
third transfer function composite estimated error that is the 
sum of an estimated error of a transfer function between the 
first loudspeaker and the second microphone and an estimated 

20 error of a transfer function between the second loudspeaker 
and the second microphone, and the fourth transfer function 
composite estimated error that is a difference between the 
estimated error of the transfer function between the first 
loudspeaker and the second microphone and the estimated error 

25 of the transfer function between the second loudspeaker and 
the second microphone. The calculation of deriving the 
estimated errors of the transfer functions of said four audio 
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transfer systems may include, for example, a calculation of 
deriving an estimated error of a transfer function of the 
first audio transfer system from the sum of the first 
transfer function composite estimated error and the second 
5 transfer function composite estimated error, a calculation of 
deriving an estimated error of a transfer function of the 
second audio transfer system from a difference between the 
first transfer function composite estimated error and the 
second transfer function composite estimated error, a 

10 calculation of deriving an estimated error of a transfer 

function of the third audio transfer system from the sum of 
the third transfer function composite estimated error and the 
fourth transfer function composite estimated error, and a 
calculation of deriving an estimated error of a transfer 

15 function of the fourth audio transfer system from a 

difference between the third transfer function composite 
estimated error and the fourth transfer function composite 
estimated error. Further, the correlation between the sum 
signal and the difference signal of said stereo audio signals 

20 is detected and, when a value of said correlation is no less 
than a prescribed value, updating of said filter 
characteristics is stopped, thereby to prevent the echo 
cancel error signals from unexpectedly increasing. 
[0018] 

25 A stereo echo cancel method of this invention is a 

method wherein, with respect to a space provided therein with 
two loudspeakers and one or two microphones and forming two 
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or four audio transfer systems in which stereo sounds 
reproduced by said respective loudspeakers are collected by 
said microphones, transfer functions of said two or four 
audio transfer systems are estimated so as to set 
5 corresponding filter characteristics, respectively, echo 
cancel signals are respectively produced by giving said 
filter characteristics to corresponding signals to be 
reproduced by said respective loudspeakers, and said echo 
cancel signals are subtracted from corresponding collected 

10 audio signals of said one or two microphones, thereby 
performing echo cancellation, and wherein a principal 
component analysis is applied to said stereo audio signals to 
produce two uncorrelated composite signals that are 
orthogonal to each other, and transfer functions of said two 

15 or four audio transfer systems are respectively derived using 
a set of said two uncorrelated composite signals as reference 
signals, thereby to set corresponding filter characteristics. 
According to this invention, since the mutually orthogonal 
two signals produced by applying the principal component 

20 analysis to the stereo audio signals are uncorrelated with 

each other, the transfer functions of said two or four audio 
transfer systems are respectively derived using such two 
signals, thereby to set the corresponding filter 
characteristics to enable echo cancellation. In accordance 

2 5 therewith, since the stereo signals can be reproduced from 
the loudspeakers with no or less processing, which induces 
deterioration, applied to the stereo signals, excellent 
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reproduced tone quality can be achieved. Further, there is 
no or only a small delay in reproduced signals. Thus, when 
applying to the teleconferencing system or the like, natural 
conversation can be conducted. The calculation of 
5 respectively deriving the transfer functions of said two or 
four audio transfer systems using the set of said two 
uncorrelated composite signals as the reference signals may 
be, for example, a calculation of respectively deriving 
transfer functions of said two or four audio transfer systems 

10 based on a cross-spectrum calculation between said two 

uncorrelated composite signals and the respective microphone 
collected audio signals. Further, the calculation of 
respectively deriving the transfer functions of said two or 
four audio transfer systems based on said cross -spectrum 

15 calculation may be, for example, a calculation of deriving 

cross spectra between said two uncorrelated composite signals 
and the respective microphone collected audio signals, and 
ensemble-averaging them in a predetermined time period per 
cross spectrum derive a plurality of kinds of composite 

20 transfer functions obtained by combining transfer functions 
of a plurality of suitable systems among said two or four 
audio transfer systems, thereby to derive transfer functions 
of said two or four audio transfer systems based on said 
plurality of kinds of composite transfer functions. 

25 [0019] 

A stereo echo cancel method of this invention is a 
method wherein, with respect to a space provided therein with 
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two loudspeakers and one or two microphones and forming two 
or four audio transfer systems in which stereo sounds 
reproduced by said respective loudspeakers are collected by 
said microphones, transfer functions of said two or four 
5 audio transfer systems are estimated so as to set 

corresponding filter characteristics, respectively, echo 
cancel signals are respectively produced by giving said 
filter characteristics to corresponding signals to be 
reproduced by said respective loudspeakers, and said echo 

10 cancel signals are subtracted from corresponding collected 
audio signals of said one or two microphones, thereby 
performing echo cancellation, and wherein a principal 
component analysis is applied to said stereo audio signals to 
produce two uncorrelated composite signals that are 

15 orthogonal to each other, and estimated errors of transfer 
functions of said two or four audio transfer systems are 
respectively derived using a set of said two uncorrelated 
composite signals as reference signals, thereby to update 
corresponding filter characteristics to values that cancel 

20 said estimated errors. According to this invention, since 

the mutually orthogonal two signals produced by applying the 
principal component analysis to the stereo audio signals are 
uncorrelated with each other, the estimated errors of the 
transfer functions of said two or four audio transfer systems 

25 are respectively derived using such two signals so as to 

successively update the corresponding filter characteristics 
to the values that cancel such estimated errors, thereby to 
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enable echo cancellation. In accordance therewith, since the 
stereo signals can be reproduced from the loudspeakers with 
no or less processing, which induces deterioration, applied 
to the stereo signals, excellent reproduced tone quality can 
5 be achieved. Further, there is no or only a small delay in 
reproduced signals. Thus, when applying to the 
teleconferencing system or the like, natural conversation can 
be conducted. It is also possible to update the echo cancel 
coefficients (filter characteristics) in real time. The 

10 filter characteristics can be updated, for example, per 

suitably determined prescribed time period (e.g. time period 
of performing said ensemble averaging) . The calculation of 
respectively deriving the estimated errors of the transfer 
functions of said two or four audio transfer systems using 

15 the set of said two uncorrelated composite signals as the 
reference signals may be, for example, a calculation of 
respectively deriving estimated errors of transfer functions 
of said two or four audio transfer systems based on a cross- 
spectrum calculation between said two uncorrelated composite 

20 signals and respective echo cancel error signals obtained by 
subtracting the corresponding echo cancel signals from the 
collected audio signals of said one or two microphones. 
Further, the calculation of respectively deriving the 
estimated errors of the transfer functions of said two or 

25 four audio transfer systems based on the cross -spectrum 

calculation between said two uncorrelated composite signals 
and said echo cancel error signals, may be, for example, a 
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calculation of deriving cross spectra between said two 
uncorrelated composite signals and said echo cancel error 
signals, and ensemble -averaging them in a predetermined time 
period per cross spectrum to derive a plurality of kinds of 
5 transfer function composite estimated errors obtained by 
combining estimated errors of transfer functions of a 
plurality of suitable systems among said two or four audio 
transfer systems, thereby to derive estimated errors of the 
transfer functions of said two or four audio transfer systems 

10 based on said plurality of kinds of transfer function 

composite estimated errors. In this case, it may be arranged 
that double talk in which sounds other than those reproduced 
by said loudspeakers are inputted into said microphones is 
detected and, when the double talk is detected, an update 

15 period of said filter characteristics is made relatively 

longer, whereas, when the double talk is not detected, the 
update period of said filter characteristics is made 
relatively shorter, so that it is possible to fully converge 
the estimated errors when the double talk exists, and further, 

2 0 quicken the convergence of the estimated errors when there is 
no double talk. 

[0020] 

A stereo audio transmission method of this invention 
is such that, with respect to two spaces each forming said 
25 four audio transfer systems, any of the foregoing multi- 
channel echo cancel methods is carried out, so that the 
stereo audio signals, which have been echo- canceled by 
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performing said method, are transmitted between said two 
spaces. In accordance therewith, the stereo audio 
transmission with reduced echo cancellation can be performed 
between two spots, which, for example, can be applied to the 
5 teleconferencing system or the like. 
[0021] 

A stereo echo canceller of this invention is a 
stereo echo canceller wherein, with respect to a space 
provided therein with two loudspeakers and two microphones 

10 and forming four audio transfer systems in which stereo 
sounds reproduced by said respective loudspeakers are 
collected by said respective microphones, an audio signal 
supplied to the first loudspeaker is subjected to convolution 
calculations by first and second filter means, respectively, 

15 which are provided corresponding to the first and second 

microphones, so as to produce first and second echo cancel 
signals, an audio signal supplied to the second loudspeaker 
is subjected to convolution calculations by third and fourth 
filter means, respectively, which are provided corresponding 

2 0 to the first and second microphones, so as to produce third 
and fourth echo cancel signals, echo cancellation is 
performed by subtracting, using first subtracting means, said 
first and third echo cancel signals from a collected audio 
signal of the first microphone, and echo cancellation is 

25 performed by subtracting, using second subtracting means, 

said second and fourth echo cancel signals from a collected 
audio signal of the second microphone, said stereo echo 
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canceller comprising: transfer function calculating means for 
respectively deriving filter characteristics corresponding to 
transfer functions of said four audio transfer systems based 
on a cross-spectrum calculation between a sum signal and a 
5 difference signal of stereo audio signals to be reproduced by 
said respective loudspeakers and the collected audio signals 
of said respective microphones, thereby to set said derived 
filter characteristics to corresponding ones of said first to 
fourth filter means, respectively. It may be arranged, for 

10 example, that the stereo echo canceller of this invention 
comprises input means for inputting said stereo audio 
signals; sum/difference signal producing means for producing 
a sum signal and a difference signal of the stereo audio 
signals inputted from said input means; and a main signal 

15 transmission system for transmitting the stereo audio signals 
inputted from said input means to said respective 
loudspeakers not through said sum/difference signal producing 
means, wherein said transfer function calculating means 
derives the filter characteristics corresponding to the 

20 transfer functions of said four audio transfer systems based 
on the cross -spectrum calculation between the sum signal and 
the difference signal produced by said sum/difference signal 
producing means and the respective microphone collected audio 
signals, and sets the derived filter characteristics to 

25 corresponding ones of said first to fourth filter means, 
respectively. Alternatively, it may be arranged that the 
stereo echo canceller comprises input means for inputting 



- 41 - 



said stereo audio signals; sum/difference signal producing 
means for producing a sum signal and a difference signal of 
the stereo audio signals inputted from said input means; and 
stereo audio signal demodulating means for calculating the 
5 sum of and a difference between the sum signal and the 

difference signal produced by said sum/difference signal 
producing means so as to recover the original stereo audio 
signals, wherein the stereo audio signals recovered by said 
stereo audio signal demodulating means is transmitted to said 

10 respective loudspeakers, and said transfer function 
calculating means derives the filter characteristics 
corresponding to the transfer functions of said four audio 
transfer systems based on the cross -spectrum calculation 
between the sum signal and the difference signal produced by 

15 said sum/difference signal producing means and the respective 
microphone collected audio signals, and sets the derived 
filter characteristics to corresponding ones of said first to 
fourth filter means, respectively. Alternatively, it may 
also be arranged that the stereo echo canceller comprises 

2 0 input means for inputting a sum signal and a difference 

signal of said stereo audio signals; and stereo audio signal 
demodulating means for calculating the sum of and a 
difference between said inputted sum signal and difference 
signal so as to recover the original stereo audio signals, 

25 wherein the stereo audio signals recovered by said stereo 
audio signal demodulating means is transmitted to said 
respective loudspeakers, and said transfer function 
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calculating means derives the filter characteristics 
corresponding to the transfer functions of said four audio 
transfer systems based on the cross-spectrum calculation 
between said inputted sum signal and difference signal and 
5 the respective microphone collected audio signals, and sets 
the derived filter characteristics to corresponding ones of 
said first to fourth filter means, respectively. 
[0022] 

A stereo echo canceller of this invention is a 

10 stereo echo canceller wherein, with respect to a space 

provided therein with two loudspeakers and two microphones 
and forming four audio transfer systems in which stereo 
sounds reproduced by said respective loudspeakers are 
collected by said respective microphones, an audio signal 

15 supplied to the first loudspeaker is subjected to convolution 
calculations by first and second filter means, respectively, 
which are provided corresponding to the first and second 
microphones, so as to produce first and second echo cancel 
signals, an audio signal supplied to the second loudspeaker 

20 is subjected to convolution calculations by third and fourth 
filter means, respectively, which are provided corresponding 
to the first and second microphones, so as to produce third 
and fourth echo cancel signals, echo cancellation is 
performed by subtracting, using first subtracting means, said 

25 first and third echo cancel signals from a collected audio 
signal of the first microphone, and echo cancellation is 
performed by subtracting, using second subtracting means, 
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said second and fourth echo cancel signals from a collected 
audio signal of the second microphone, said stereo echo 
canceller comprising: transfer function calculating means for 
respectively deriving estimated errors of transfer functions 
5 of said four audio transfer systems based on a cross-spectrum 
calculation between a sum signal and a difference signal of 
stereo audio signals to be reproduced by said respective 
loudspeakers and respective echo cancel error signals 
obtained by subtracting the corresponding echo cancel signals 

10 from the collected audio signals of said two microphones, 
thereby to update filter characteristics of said first to 
fourth filter means to values that cancel said estimated 
errors, respectively. It may be arranged, for example, that 
the stereo echo canceller of this invention comprises input 

15 means for inputting said stereo audio signals; sum/difference 
signal producing means for producing a sum signal and a 
difference signal of the stereo audio signals inputted from 
said input means; and a main signal • transmission system for 
transmitting the stereo audio signals inputted from said 

20 input means to said respective loudspeakers not through said 
sum/difference signal producing means, wherein said transfer 
function calculating means derives the estimated errors of 
the transfer functions of said four audio transfer systems 
based on the cross -spectrum calculation between the sum 

25 signal and the difference signal produced by said 

sum/difference signal producing means and the respective echo 
cancel error signals, and updates the filter characteristics 
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of said first to fourth filter means to the values that 
cancel said estimated errors, respectively. Alternatively, 
it may be arranged that the stereo echo canceller comprises 
input means for inputting said stereo audio signals; 
5 sum/difference signal producing means for producing a sum 
signal and a difference signal of the stereo audio signals 
inputted from said input means; and stereo audio signal 
demodulating means for calculating the sum of and a 
difference between the sum signal and the difference signal 

10 produced by said sum/difference signal producing means so as 
to recover the original stereo audio signals, wherein the 
stereo audio signals recovered by said stereo audio signal 
demodulating means is transmitted to said respective 
loudspeakers, and said transfer function calculating means 

15 derives the estimated errors of the transfer functions of 

said four audio transfer systems based on the cross -spectrum 
calculation between the sum signal and the difference signal 
produced by said sum/difference signal producing means and 
the respective echo cancel error signals, and updates the 

20 filter characteristics of said first to fourth filter means 

to the values that cancel said estimated errors, respectively. 
Alternatively, it may also be arranged that the stereo echo 
canceller comprises input means for inputting a sum signal 
and a difference signal of said stereo audio signals; and 

25 stereo audio signal demodulating means for calculating the 

sum of and a difference between said inputted sum signal and 
difference signal so as to recover the original stereo audio 



- 45 - 



signals, wherein the stereo audio signals recovered by said 
stereo audio signal demodulating means is transmitted to said 
respective loudspeakers, and said transfer function 
calculating means derives the estimated errors of the 
5 transfer functions of said four audio transfer systems based 
on the cross-spectrum calculation between said inputted sum 
signal and difference signal produced by said sum/difference 
signal producing means and the respective echo cancel error 
signals, and updates the filter characteristics of said first 
10 to fourth filter means to the values that cancel said 
estimated errors, respectively. 
[0023] 

A stereo echo canceller of this invention is a 
stereo echo canceller wherein, with respect to a space 

15 provided therein with two loudspeakers and two microphones 
and forming four audio transfer systems in which stereo 
sounds reproduced By said respective loudspeakers are 
collected by said respective microphones, an audio signal 
supplied to the first loudspeaker is subjected to convolution 

20 calculations by first and second filter means, respectively, 
which are provided corresponding to the first and second 
microphones, so as to produce first and second echo cancel 
signals, an audio signal supplied to the second loudspeaker 
is subjected to convolution calculations by third and fourth 

25 filter means, respectively, which are provided corresponding 
to the first and second microphones, so as to produce third 
and fourth echo cancel signals, echo cancellation is 
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performed by subtracting, using first subtracting means, said 
first and third echo cancel signals from a collected audio 
signal of the first microphone, and echo cancellation is 
performed by subtracting, using second subtracting means, 
5 said second and fourth echo cancel signals from a collected 
audio signal of the second microphone, said stereo echo 
canceller comprising: transfer function calculating means for 
respectively deriving filter characteristics corresponding to 
transfer functions of said four audio transfer systems based 

10 on a cross-spectrum calculation between mutually orthogonal 
two uncorrelated composite signals produced by applying a 
principal component analysis to stereo audio signals to be 
reproduced by said respective loudspeakers and the respective 
microphone collected audio signals, thereby to set said 

15 derived filter characteristics to corresponding ones of said 
first to fourth filter means, respectively. It may be 
arranged, for example, that the stereo echo canceller of this 
invention comprises input means for inputting said stereo 
audio signals; orthogonal i zing means for applying a principal 

20 component analysis to the stereo audio signals inputted from 
said input means to produce mutually orthogonal two 
uncorrelated composite signals ; and a main signal 
transmission system for transmitting the stereo audio signals 
inputted from said input means to said respective 

2 5 loudspeakers not through said orthogonal i zing means, wherein 
said transfer function calculating means derives the filter 
characteristics corresponding to the transfer functions of 
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said four audio transfer systems based on the cross-spectrum 
calculation between the two uncorrelated composite signals 
produced by said orthogonalizing means and the respective 
microphone collected audio signals, and sets the derived 
5 filter characteristics to corresponding ones of said first to 
fourth filter means, respectively. 
[0024] 

A stereo echo canceller of this invention is a 
stereo echo canceller wherein, with respect to a space 

10 provided therein with two loudspeakers and two microphones 
and forming four audio transfer systems in which stereo 
sounds reproduced by said respective loudspeakers are 
collected by said respective microphones, an audio signal 
supplied to the first loudspeaker is subjected to convolution 

15 calculations by first and second filter means, respectively, 
which are provided corresponding to the first and second 
microphones, so as to produce first and second echo cancel 
signals, an audio signal supplied to the second loudspeaker 
is subjected to convolution calculations by third and fourth 

20 filter means, respectively, which are provided corresponding 
to the first and second microphones, so as to produce third 
and fourth echo cancel signals, echo cancellation is 
performed by subtracting, using first subtracting means, said 
first and third echo cancel signals from a collected audio 

25 signal of the first microphone, and echo cancellation is 
performed by subtracting, using second subtracting means, 
said second and fourth echo cancel signals from a collected 
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audio signal of the second microphone, said stereo echo 
canceller comprising: transfer function calculating means for 
respectively deriving estimated errors of transfer functions 
of said four audio transfer systems based on a cross -spectrum 
5 calculation between mutually orthogonal two uncorrelated 

composite signals produced by applying a principal component 
analysis to stereo audio signals to be reproduced by said 
respective loudspeakers and respective echo cancel error 
signals obtained by subtracting the corresponding echo cancel 

10 signals from the collected audio signals of said two 

microphones, thereby to update filter characteristics of said 
first to fourth filter means to values that cancel said 
estimated errors, respectively. It may be arranged, for 
example, that the stereo echo canceller of this invention 

15 comprises input means for inputting said stereo audio 

signals; orthogonal i zing means for applying a principal 
component analysis to the stereo audio signals inputted from 
said input means to produce mutually orthogonal two 
uncorrelated composite signals; and a main signal 

20 transmission system for transmitting the stereo audio signals 
inputted from said input means to said respective 
loudspeakers not through said orthogonalizing means, wherein 
said transfer function calculating means derives the 
estimated errors of the transfer functions of said four audio 

25 transfer systems based on the cross -spectrum calculation 

between the two uncorrelated composite signals produced by 
said orthogonalizing means and the respective echo cancel 
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error signals, and updates the filter characteristics of said 
first to fourth filter means to the values that cancel said 
estimated errors, respectively. In this case, it may be 
arranged that double talk detecting means is provided for 
5 detecting double talk in which sounds other than those 
reproduced by said loudspeakers are inputted into said 
microphones and, when the double talk is detected, said 
transfer function calculating means makes relatively longer 
an update period of said filter characteristics, whereas, 

10 when the double talk is not detected, it makes relatively 

shorter the update period of said filter characteristics, so 
that it is possible to fully converge the estimated errors 
when the double talk exists, and further, quicken the 
convergence of the estimated errors when there is no double 

15 talk. 

[0025] 

The stereo echo canceller of this invention may be 
further provided with correlation detecting means for 
detecting the correlation between the sum signal and the 

20 difference signal of said stereo audio signals and, when a 

value of said correlation is no less than a prescribed value, 
stopping updating of said filter characteristics, thereby to 
prevent the echo cancel error signals from unexpectedly 
increasing . 

25 [0026] 

A stereo sound transfer apparatus of this invention 
is such that, with respect to two spaces each forming said 
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four audio transfer systems, any of said stereo echo 
cancellers is arranged in each space, so that the stereo 
audio signals, which have been echo-canceled by said stereo 
echo cancellers, are transmitted between said two spaces. 
5 [0027] 

[Inventions of Claims 22 to 27 and Inventions 
relating to such Inventions] 

A multi-channel echo cancel method of this invention 
is a method wherein, with respect to a space provided therein 

10 with a plurality of loudspeakers and one or a plurality of 

microphones and forming a plurality of audio transfer systems 
in which multi-channel sounds reproduced by said respective 
loudspeakers and having a correlation with each other are 
collected by said microphones, composite transfer functions 

15 of said plurality of audio transfer systems are estimated so 
as to set corresponding filter characteristics, respectively, 
echo cancel signals are respectively produced by giving said 
set filter characteristics to composite signals of individual 
signals to be reproduced by said respective loudspeakers, and 

2 0 said echo cancel signals are subtracted from individual 
collected audio signals of said one or plurality of 
microphones, thereby performing echo cancellation, and 
wherein, using as reference signals a set of a plurality of 
low-correlation composite signals which correspond to signals 

25 obtained by suitably combining said multi-channel audio 

signals and which have a lower correlation with each other 
than that between said multi -channel audio signals (e.g. 
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suitably combining said multi -channel audio signals to 
produce a plurality of low-correlation composite signals 
having a lower correlation with each other than that between 
said multi-channel audio signals and using a set of said 
5 plurality of low-correlation composite signals as reference 
signals, or directly inputting a set of a plurality of low- 
correlation composite signals which correspond to signals 
obtained by suitably combining said multi-channel audio 
signals and which have a lower correlation with each other 

10 than that between said multi-channel audio signals and using 
the set of said plurality of low-correlation composite 
signals as reference signals, or the like) , composite 
transfer functions of said plurality of audio transfer 
systems are respectively derived, thereby to set 

15 corresponding filter characteristics. According to this 

invention, using as reference signals a set of a plurality of 
low-correlation composite signals which correspond to signals 
obtained by suitably combining multi -channel audio signals 
having a correlation therebetween and which have a lower 

20 correlation with each other than that between such multi- 
channel audio signals, the composite transfer functions of 
said plurality of audio transfer systems are respectively 
derived, and corresponding filter characteristics are set, 
thereby to enable echo cancellation. In accordance therewith, 

25 since the multi -channel audio signals can be reproduced from 
the loudspeakers with no or less processing, which induces 
deterioration, applied to the multi-channel audio signals, 
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excellent reproduced tone quality can be achieved. Further, 
there is no or only a small delay in reproduced signals. 
Thus, when applying to the teleconferencing system or the 
like, natural conversation can be conducted. The calculation 
5 of respectively deriving the composite transfer functions of 
said plurality of audio transfer systems using as the 
reference signals the set of the plurality of low-correlation 
composite signals, may be, for example, a calculation of 
respectively deriving the composite transfer functions of 

10 said plurality of audio transfer systems based on a cross- 
spectrum calculation between the plurality of low-correlation 
composite signals and the individual collected audio signals 
of the respective microphones. Further, the calculation of 
respectively deriving the composite transfer functions of 

15 said plurality of audio transfer systems based on said cross- 
spectrum calculation, may be, for example, a calculation of 
combining said mult i -channel audio signals through addition 
or subtraction to produce a plurality of low-correlation 
composite signals having a lower correlation with each other 

2 0 than that between said multi -channel audio signals, deriving 
cross spectra between said plurality of low-correlation 
composite signals and the individual collected audio signals 
of the respective microphones, and ensemble-averaging them in 
a predetermined time period per cross spectrum to derive 

25 composite transfer functions of said plurality of audio 
transfer systems. 
[0028] 
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A multi- channel echo cancel method of this invention 
is a method wherein, with respect to a space provided therein 
with a plurality of loudspeakers and one or a plurality of 
microphones and forming a plurality of audio transfer systems 
5 in which multi -channel sounds reproduced by said respective 
loudspeakers and having a correlation with each other are 
collected by said microphones, composite transfer functions 
of said plurality of audio transfer systems are estimated so 
as to set corresponding filter characteristics, respectively, 

10 echo cancel signals are respectively produced by giving said 
set filter characteristics to composite signals of individual 
signals to be reproduced by said respective loudspeakers, and 
said echo cancel signals are subtracted from individual 
collected audio signals of said one or plurality of 

15 microphones, thereby performing echo cancellation, and 

wherein, using as reference signals a set of a plurality of 
low-correlation composite signals which correspond to signals 
obtained by suitably combining said multi -channel audio 
signals and which have a lower correlation with each other 

20 than that between said multi-channel audio signals (e.g. 
suitably combining said multi-channel audio signals to 
produce a plurality of low-correlation composite signals 
having a lower correlation with each other than that between 
said multi-channel audio signals and using a set of said 

25 plurality of low-correlation composite signals as reference 
signals, or directly inputting a set of a plurality of low- 
correlation composite signals which correspond to signals 
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obtained by suitably combining said multi -channel audio 
signals and which have a lower correlation with each other 
than that between said multi -channel audio signals and using 
the set of said plurality of low-correlation composite 
5 signals as reference signals, or the like) , estimated errors 
of composite transfer functions of said plurality of audio 
transfer systems are respectively derived, thereby to update 
corresponding filter characteristics to values that cancel 
said estimated errors. According to this invention, using as 

10 reference signals a set of a plurality of low-correlation 
composite signals which correspond to signals obtained by 
suitably combining multi-channel audio signals having a 
correlation therebetween and which have a lower correlation 
with each other than that between such multi-channel audio 

15 signals, the estimated errors of the composite transfer 

functions of said plurality of audio transfer systems are 
respectively derived so as to successively update the 
corresponding filter characteristics to the values that 
cancel said estimated errors, thereby to enable echo 

20 cancellation. In accordance therewith, since the multi- 
channel audio signals can be reproduced from the loudspeakers 
with no or less processing, which induces deterioration, 
applied to the multi-channel audio signals, excellent 
reproduced tone quality can be achieved. Further, there is 

25 no or only a small delay in reproduced signals. Thus, when 
applying to the teleconferencing system or the like, natural 
conversation can be conducted. It is also possible to update 
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the filter characteristics in real time. The calculation of 
respectively deriving the estimated errors of the composite 
transfer functions of said plurality of audio transfer 
systems using the set of the plurality of low-correlation 
5 composite signals as the reference signals, may be, for 

example, a calculation of respectively deriving the estimated 
errors of the composite transfer functions of said plurality 
of audio transfer systems based on a cross -spectrum 
calculation between said plurality of low-correlation 

10 composite signals and echo cancel error signals obtained by 
subtracting the corresponding echo cancel signals from the 
individual collected audio signals of said one or plurality 
of microphones. Further, the calculation of respectively 
deriving the estimated errors of the composite transfer 

15 functions of said plurality of audio transfer systems based 
on said cross-spectrum calculation, may be, for example, a 
calculation of combining said multi-channel audio signals 
through addition or subtraction to produce a plurality of 
low-correlation composite signals having a lower correlation 

20 with each other than that between said multi-channel audio 
signals, deriving cross spectra between said plurality of 
low-correlation composite signals and the echo cancel error 
signals obtained by subtracting the corresponding echo cancel 
signals from the individual collected audio signals of said 

25 one or plurality of microphones, and ensemble-averaging them 
in a predetermined time period per cross spectrum to derive 
estimated errors of the composite transfer functions of said 
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plurality of audio transfer systems. Further, the 
correlation between said plurality of low-correlation 
composite signals is detected and, when a value of said 
correlation is no less than a prescribed value, updating of 
5 said filter characteristics is stopped, thereby to prevent 
the echo cancel error signals from unexpectedly increasing. 
[0029] 

A multi-channel sound transfer method of this 
invention is such that, with respect to two spaces each 

10 forming said plurality of audio transfer systems, any of the 
foregoing mult i -channel echo cancel methods is carried out, 
so that the mult i -channel audio signals, which have been 
echo-canceled by performing said method, are transmitted 
between said two spaces. In accordance therewith, the multi- 

15 channel audio transmission with reduced echo cancellation can 
be performed between two spots, which, for example, can be 
applied to the teleconferencing system or the like. 
[0030] 

A stereo echo cancel method of this invention is a 
20 .method wherein, with respect to a space provided therein with 
two loudspeakers and one or two microphones and forming two 
or four audio transfer systems in which stereo sounds 
reproduced by said respective loudspeakers are collected by 
said microphones, composite transfer functions of said two or 
25 four audio transfer systems are estimated so as to set 

corresponding filter characteristics, respectively, echo 
cancel signals are respectively produced by giving said set 
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filter characteristics to composite signals of individual 
signals to be reproduced by said respective loudspeakers, and 
said echo cancel signals are subtracted from individual 
collected audio signals of said one or two microphones, 
5 thereby performing echo cancellation, and wherein, using a 
sum signal and a difference signal of said stereo audio 
signals as reference signals, composite transfer functions of 
said two or four audio transfer systems are respectively 
derived, thereby to set corresponding filter characteristics. 

10 According to this invention, since the sum signal and the 
difference signal of the stereo audio signals have a low 
correlation therebetween, the composite transfer functions of 
the two or four audio transfer systems are respectively 
derived using the sum signal and the difference signal as 

15 reference signals, so as to set the corresponding filter 
characteristics, thereby to enable echo cancellation. In 
accordance therewith, since the stereo signals can be 
reproduced from the loudspeakers with no or less processing, 
which induces deterioration, applied to the stereo signals, 

20 excellent reproduced tone quality can be achieved. Further, 
there is no or only a small delay in reproduced signals. 
Thus, when applying to the teleconferencing system or the 
like, natural conversation can be conducted. The calculation 
of respectively deriving the composite transfer functions of 

25 said two or four audio transfer systems using the sum signal 
and the difference signal of said stereo audio signals as the 
reference signals, may be, for example, a calculation of 
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respectively deriving the composite transfer functions of 
said two or four audio transfer systems based on a cross- 
spectrum calculation between the sum signal and the 
difference signal , and the individual collected audio signals 
5 of the respective microphones. Further, the calculation of 
respectively deriving the composite transfer functions of 
said two or four audio transfer systems based on said cross- 
spectrum calculation, may be, for example, a calculation of 
deriving cross spectra between the sum signal and the 

10 difference signal of said stereo audio signals and the 
individual collected audio signals of the respective 
microphones, and ensemble-averaging them in a predetermined 
time period per cross spectrum to derive composite transfer 
functions of said two or four audio transfer systems. 

15 [0031] 

A stereo echo cancel method of this invention is a 
method wherein, with respect to a space provided therein with 
two loudspeakers and one or two microphones and forming two 
or four audio transfer systems in which stereo sounds 

2 0 reproduced by said respective loudspeakers are collected by 

said microphones, composite transfer functions of said two or 
four audio transfer systems are estimated so as to set 
corresponding filter characteristics, respectively, echo 
cancel signals are respectively produced by giving said set 

25 filter characteristics to composite signals of individual 

signals to be reproduced by said respective loudspeakers, and 
said echo cancel signals are subtracted from individual 
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collected audio signals of said one or two microphones, 
thereby performing echo cancellation, and wherein, using a 
sum signal and a difference signal of said stereo audio 
signals as reference signals, estimated errors of composite 
5 transfer functions of said two or four audio transfer systems 
are respectively derived, thereby to update corresponding 
filter characteristics to values that cancel said estimated 
errors. According to this invention, since the sum signal 
and the difference signal of the stereo audio signals have a 

10 low correlation therebetween, the estimated errors of the 
composite transfer functions of the two or four audio 
transfer systems are respectively derived using the sum 
signal and the difference signal as reference signals, so as 
to successively update the corresponding filter 

15 characteristics to the values that cancel the estimated 

errors, thereby to enable echo cancellation. In accordance 
therewith, since the stereo signals can be reproduced from 
the loudspeakers with no or less processing, which induces 
deterioration, applied to the stereo signals, excellent 

2 0 reproduced tone quality can be achieved. Further, there is 
no or only a small delay in reproduced signals. Thus, when 
applying to the teleconferencing system or the like, natural 
conversation can be conducted. The calculation of 
respectively deriving the estimated errors of the composite 

25 transfer functions of said two or four audio transfer systems 
using the sum signal and the difference signal of said stereo 
audio signals as the reference signals, may be, for example, 
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a calculation of respectively deriving the estimated errors 
of the composite transfer functions of said two or four audio 
transfer systems based on a cross -spectrum calculation 
between the sum signal and the difference signal of said 
5 stereo audio signals and respective echo cancel error signals 
obtained by subtracting the corresponding echo cancel signals 
from the individual collected audio signals of said one or 
two microphones. Further, the calculation of respectively 
deriving the estimated errors of the composite transfer 

10 functions of said two or four audio transfer systems based on 
the cross -spectrum calculation between the sum signal and the 
difference signal of said stereo audio signals and said echo 
cancel error signals, may be, for example, a calculation of 
deriving cross spectra between the sum signal and the 

15 difference signal of said stereo audio signals and said echo 
cancel error signals, and ensemble -averaging them in a 
predetermined time period per cross spectrum to derive 
estimated errors of the composite transfer functions of said 
two or four audio transfer systems. Further, the correlation 

20 between the sum signal and the difference signal of said 

stereo audio signals is detected and, when a value of said 
correlation is no less than a prescribed value, updating of 
said filter characteristics is stopped, thereby to prevent 
the echo cancel error signals from unexpectedly increasing. 

25 [0032] 

A stereo audio transmission method of this invention 
is such that, with respect to two spaces each forming said 
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four audio transfer systems, any of the foregoing multi- 
channel echo cancel methods is carried out, so that the 
stereo audio signals, which have been echo-canceled by 
performing said method, are transmitted between said two 
5 spaces. In accordance therewith, the stereo audio 

transmission with reduced echo cancellation can be performed 
between two spots, which, for example, can be applied to the 
teleconferencing system or the like. 
[0033] 

10 A stereo echo canceller of this invention is a 

stereo echo canceller wherein, with respect to a space 
provided therein with two loudspeakers and two microphones 
and forming four audio transfer systems in which stereo 
sounds reproduced by said respective loudspeakers are 

15 collected by said respective microphones, a sum signal of 
stereo audio signals to be reproduced by said respective 
loudspeakers is subjected to convolution calculations by 
first and second filter means, respectively, so as to produce 
first and second echo cancel signals, a difference signal of 

20 the stereo audio signals to be reproduced by said respective 
loudspeakers is subjected to convolution calculations by 
third and fourth filter means, respectively, so as to produce 
third and fourth echo cancel signals, echo cancellation is 
performed by subtracting, using first subtracting means, said 

25 first and third echo cancel signals from a collected audio 
signal of the first microphone, and echo cancellation is 
performed by subtracting, using second subtracting means, 
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said second and fourth echo cancel signals from a collected 
audio signal of the second microphone, said stereo echo 
canceller comprising: transfer function calculating means for 
respectively deriving filter characteristics corresponding to 
5 composite transfer functions of said four audio transfer 

systems based on a cross -spectrum calculation between the sum 
signal and the difference signal of the stereo audio signals 
to be reproduced by said respective loudspeakers and the 
respective microphone collected audio signals, thereby to set 

10 said derived filter characteristics to corresponding ones of 
said first to fourth filter means, respectively. It may be 
arranged, for example, that the stereo echo canceller of this 
invention comprises input means for inputting said stereo 
audio signals; sum/difference signal producing means for 

15 producing a sum signal and a difference signal of the stereo 
audio signals inputted from said input means; and a main 
signal transmission system for transmitting the stereo audio 
signals inputted from said input means to said respective 
loudspeakers not through said sum/difference signal producing 

20 means, wherein said transfer function calculating means 
derives the filter characteristics corresponding to the 
composite transfer functions of said four audio transfer 
systems based on the cross -spectrum calculation between the 
sum signal and the difference signal produced by said 

25 sum/difference signal producing means and the respective 
microphone collected audio signals, and sets the derived 
filter characteristics to corresponding ones of said first to 
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fourth filter means, respectively. 
[0034] 

A stereo echo canceller of this invention is a 
stereo echo canceller wherein, with respect to a space 
5 provided therein with two loudspeakers and two microphones 
and forming four audio transfer systems in which stereo 
sounds reproduced by said respective loudspeakers are 
collected by said respective microphones, a sum signal of 
stereo audio signals to be reproduced by said respective 

10 loudspeakers is subjected to convolution calculations by 

first and second filter means, respectively, so as to produce 
first and second echo cancel signals, a difference signal of 
the stereo audio signals to be reproduced by said respective 
loudspeakers is subjected to convolution calculations by 

15 third and fourth filter means, respectively, so as to produce 
third and fourth echo cancel signals, echo cancellation is 
performed by subtracting, using first subtracting means, said 
first and third echo cancel signals from a collected audio 
signal of the first microphone, and echo cancellation is 

20 performed by subtracting, using second subtracting means, 

said second and fourth echo cancel signals from a collected 
audio signal of the second microphone, said stereo echo 
canceller comprising: transfer function calculating means for 
respectively deriving estimated errors of composite transfer 

25 functions of said four audio transfer systems based on a 
cross-spectrum calculation between the sum signal and the 
difference signal of the stereo audio signals to be 
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reproduced by said respective loudspeakers and respective 
echo cancel error signals obtained by subtracting the 
corresponding echo cancel signals from the collected audio 
signals of said two microphones, thereby to update filter 
5 characteristics of said first to fourth filter means to 

values that cancel said estimated errors, respectively. It 
may be arranged, for example, that the stereo echo canceller 
of this invention comprises input means for inputting said 
stereo audio signals; sum/difference signal producing means 

10 for producing a sum signal and a difference signal of the 
stereo audio signals inputted from said input means; and a 
main signal transmission system for transmitting the stereo 
audio signals inputted from said input means to said 
respective loudspeakers not through said sum/difference 

15 signal producing means, wherein said transfer function 
calculating means derives the estimated errors of the 
composite transfer functions of said four audio transfer 
systems based on the cross-spectrum calculation between the 
sum signal and the difference signal produced by said 

20 sum/difference signal producing means and the respective echo 
cancel error signals, and updates the filter characteristics 
of said first to fourth filter means to the values that 
cancel said estimated errors, respectively. 
[0035] 

25 The stereo echo canceller of this invention may be 

further provided with correlation detecting means for 
detecting the correlation between the sum signal and the 
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difference signal of said stereo audio signals and, when a 
value of said correlation is no less than a prescribed value, 
stopping updating of said filter characteristics, thereby to 
prevent the echo cancel error signals from unexpectedly 
5 increasing. 

[0036] 

A stereo sound transfer apparatus of this invention 
is such that, with respect to two spaces each forming said 
four audio transfer systems, any of said stereo echo 
10 cancellers is arranged in each space, so that the stereo 

audio signals, which have been echo-canceled by said stereo 
echo cancellers, are transmitted between said two spaces. 
[0037] 

[Inventions of Claims 28 to 31 and Inventions 

15 relating to such Inventions] 

A multi-channel echo cancel method of this invention 
is a method wherein, with respect to a space provided therein 
with a plurality of loudspeakers and one or a plurality of 
microphones and forming a plurality of audio transfer systems 

20 in which multi-channel sounds reproduced by said respective 
loudspeakers and having a correlation with each other are 
collected by said microphones, composite transfer functions 
of said plurality of audio transfer systems are estimated so 
as to set corresponding filter characteristics, respectively, 

25 echo cancel signals are respectively produced by giving said 
set filter characteristics to individual signals to be 
reproduced by said respective loudspeakers, and said echo 
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cancel signals are subtracted from composite signals of 
individual collected audio signals of said one or plurality 
of microphones, thereby performing echo cancellation, and 
wherein, using as reference signals a set of a plurality of 
5 low-correlation composite signals which correspond to signals 
obtained by suitably combining said multi-channel audio 
signals and which have a lower correlation with each other 
than that between said multi-channel audio signals (e.g. 
suitably combining said multi -channel audio signals to 

10 produce a plurality of low-correlation composite signals 

having a lower correlation with each other than that between 
said multi -channel audio signals and using a set of said 
plurality of low-correlation composite signals as reference 
signals, or directly inputting a set of a plurality of low- 

15 correlation composite signals which correspond to signals 
obtained by suitably combining said multi-channel audio 
signals and which have a lower correlation with each other 
than that between said multi-channel audio signals and using 
the set of said plurality of low-correlation composite 

20 signals as reference signals, or the like) , composite 
transfer functions of said plurality of audio transfer 
systems are respectively derived, thereby to set 
corresponding filter characteristics. According to this 
invention, using as reference signals a set of a plurality of 

25 low-correlation composite signals which correspond to signals 
obtained by suitably combining multi -channel audio signals 
having a correlation therebetween and which have a lower 
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correlation with each other than that between such multi- 
channel audio signals, the composite transfer functions of 
said plurality of audio transfer systems are respectively 
derived, and corresponding filter characteristics are set, 
5 thereby to enable echo cancellation. In accordance therewith, 
since the multi- channel audio signals can be reproduced from 
the loudspeakers with no or less processing, which induces 
deterioration, applied to the multi-channel audio signals, 
excellent reproduced tone quality can be achieved. Further, 

10 there is no or only a small delay in reproduced signals. 

Thus, when applying to the teleconferencing system or the 
like, natural conversation can be conducted. The calculation 
of respectively deriving the composite transfer functions of 
said plurality of audio transfer systems using as the 

15 reference signals the set of the plurality of low-correlation 
composite signals, may be, for example, a calculation of 
respectively deriving the composite transfer functions of 
said plurality of audio transfer systems based on a cross- 
spectrum calculation between the plurality of low-correlation 

2 0 composite signals and the composite signals of the individual 
collected audio signals of the respective microphones. 
Further, the calculation of respectively deriving the 
composite transfer functions of said plurality of audio 
transfer systems based on said cross-spectrum calculation, 

25 may be, for example, a calculation of combining said multi- 
channel audio signals through addition or subtraction to 
produce a plurality of low-correlation composite signals 



having a lower correlation with each other than that between 
said multi- channel audio signals, deriving cross spectra 
between said plurality of low-correlation composite signals 
and the composite signals of the individual collected audio 
5 signals of the respective microphones, and ensemble-averaging 
them in a predetermined time period per cross spectrum to 
derive composite transfer functions of said plurality of 
audio transfer systems. 
[0038] 

10 A multi-channel echo cancel method of this invention 

is a method wherein, with respect to a space provided therein 
with a plurality of loudspeakers and one or a plurality of 
microphones and forming a plurality of audio transfer systems 
in which multi-channel sounds reproduced by said respective 

15 loudspeakers and having a correlation with each other are 

collected by said microphones, composite transfer functions 
of said plurality of audio transfer systems are estimated so 
as to set corresponding filter characteristics, respectively, 
echo cancel signals are respectively produced by giving said 

20 set filter characteristics to individual signals to be 

reproduced by said respective loudspeakers, and said echo 
cancel signals are subtracted from composite signals of 
individual collected audio signals of said one or plurality 
of microphones, thereby performing echo cancellation, and 

25 wherein, using as reference signals a set of a plurality of 

low- correlation composite signals which correspond to signals 
obtained by suitably combining said multi -channel audio 
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signals and which have a lower correlation with each other 
than that between said multi- channel audio signals (e.g. 
suitably combining said multi -channel audio signals to 
produce a plurality of low-correlation composite signals 
5 having a lower correlation with each other than that between 
said multi-channel audio signals and using a set of said 
plurality of low-correlation composite signals as reference 
signals, or directly inputting a set of a plurality of low- 
correlation composite signals which correspond to signals 

10 obtained by suitably combining said multi-channel audio 

signals and which have a lower correlation with each other 
than that between said multi-channel audio signals and using 
the set of said plurality of low-correlation composite 
signals as reference signals, or the like) , estimated errors 

15 of composite transfer functions of said plurality of audio 

transfer systems are respectively derived, thereby to update 
corresponding filter characteristics to values that cancel 
said estimated errors. According to this invention, using as 
reference signals a set of a plurality of low-correlation 

2 0 composite signals which correspond to signals obtained by 
suitably combining multi -channel audio signals having a 
correlation therebetween and which have a lower correlation 
with each other than that between such multi-channel audio 
signals, the estimated errors of the composite transfer 

25 functions of said plurality of audio transfer systems are 
respectively derived so as to successively update the 
corresponding filter characteristics to the values that 
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cancel said estimated errors, thereby to enable echo 
cancellation. In accordance therewith, since the multi- 
channel audio signals can be reproduced from the loudspeakers 
with no or less processing, which induces deterioration, 
5 applied to the mult i -channel audio signals, excellent 

reproduced tone quality can be achieved. Further, there is 
no or only a small delay in reproduced signals. Thus, when 
applying to the teleconferencing system or the like, natural 
conversation can be conducted. It is also possible to update 

10 the filter characteristics in real time. The calculation of 
respectively deriving the estimated errors of the composite 
transfer functions of said plurality of audio transfer 
systems using the set of the plurality of low-correlation 
composite signals as the reference signals, may be, for 

15 example, a calculation of respectively deriving the estimated 
errors of the composite transfer functions of said plurality 
of audio transfer systems based on a cross -spectrum 
calculation between said plurality of low-correlation 
composite signals and echo cancel error signals obtained by 

20 subtracting the corresponding echo cancel signals from the 
composite signals of the individual collected audio signals 
of said one or plurality of microphones. Further, the 
calculation of respectively deriving the estimated errors of 
the composite transfer functions of said plurality of audio 

25 transfer systems based on said cross -spectrum calculation, 
may be, for example, a calculation of combining said multi- 
channel audio signals through addition or subtraction to 
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produce a plurality of low-correlation composite signals 
having a lower correlation with each other than that between 
said multi-channel audio signals, deriving cross spectra 
between said plurality of low-correlation composite signals 
5 and the echo cancel error signals obtained by subtracting the 
corresponding echo cancel signals from the composite signals 
of the individual collected audio signals of said one or 
plurality of microphones, and ensemble -averaging them in a 
predetermined time period per cross spectrum to derive 

10 estimated errors of the composite transfer functions of said 
plurality of audio transfer systems. Further, the 
correlation between said plurality of low-correlation 
composite signals is detected and, when a value of said 
correlation is no less than a prescribed value, updating of 

15 said filter characteristics is stopped, thereby to prevent 
the echo cancel error signals from unexpectedly increasing. 
[0039] 

A mult i -channel sound transfer method of this 
invention is such that, with respect to two spaces each 

20 forming said plurality of audio transfer systems, any of the 
foregoing multi-channel echo cancel methods is carried out, 
so that the multi-channel audio signals, which have been 
echo-canceled by performing said method, are transmitted 
between said two spaces. In accordance therewith, the multi- 

25 channel audio transmission with reduced echo cancellation can 
be performed between two spots, which, for example, can be 
applied to the teleconferencing system or the like. 
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[0040] 

A stereo echo cancel method of this invention is a 
method wherein, with respect to a space provided therein with 
two loudspeakers and one or two microphones and forming two 
5 or four audio transfer systems in which stereo sounds 

reproduced by said respective loudspeakers are collected by 
said microphones, composite transfer functions of said two or 
four audio transfer systems are estimated so as to set 
corresponding filter characteristics, respectively, echo 

10 cancel signals are respectively produced by giving said set 

filter characteristics to individual signals to be reproduced 
by said respective loudspeakers, and said echo cancel signals 
are subtracted from composite signals of individual collected 
audio signals of said one or two microphones, thereby 

15 performing echo cancellation, and wherein, using a sum signal 
and a difference signal of said stereo audio signals as 
reference signals, composite transfer functions of said two 
or four audio transfer systems are respectively derived, 
thereby to set corresponding filter characteristics. 

20 According to this invention, since the sum signal and the 
difference signal of the stereo audio signals have a low 
correlation therebetween, estimated errors of the composite 
transfer functions of the two or four audio transfer systems 
are respectively derived using the sum signal and the 

25 difference signal as reference signals, so as to successively 
update the corresponding filter characteristics to values 
that cancel said estimated errors, thereby to enable echo 
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cancellation. In accordance therewith, since the stereo 
signals can be reproduced from the loudspeakers with no or 
less processing, which induces deterioration, applied to the 
stereo signals, excellent reproduced tone quality can be 
5 achieved. Further, there is no or only a small delay in 
reproduced signals. Thus, when applying to the 
teleconferencing system or the like, natural conversation can 
be conducted. The calculation of respectively deriving the 
composite transfer functions of said two or four audio 

10 transfer systems using the sum signal and the difference 

signal of said stereo audio signals as the reference signals, 
may be, for example, a calculation of respectively deriving 
the composite transfer functions of said two or four audio 
transfer systems based on a cross- spectrum calculation 

15 between the sum signal and the difference signal, and the 

composite signals of the individual collected audio signals 
of the respective microphones. Further, the calculation of 
respectively deriving the composite transfer functions of 
said two or four audio transfer systems based on said cross - 

20 spectrum calculation, may be, for example, a calculation of 
deriving cross spectra between the sum signal and the 
difference signal of said stereo audio signals and the 
composite signals of the individual collected audio signals 
of the respective microphones, and ensemble -averaging them in 

25 a predetermined time period per cross spectrum to derive 
composite transfer functions of said two or four audio 
transfer systems. 
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[0041] 

A stereo echo cancel method of this invention is a 
method wherein, with respect to a space provided therein with 
two loudspeakers and one or two microphones and forming two 
5 or four audio transfer systems in which stereo sounds 

reproduced by said respective loudspeakers are collected by 
said microphones, composite transfer functions of said two or 
four audio transfer systems are estimated so as to set 
corresponding filter characteristics, respectively, echo 

10 cancel signals are respectively produced by giving said set 

filter characteristics to individual signals to be reproduced 
by said respective loudspeakers, and said echo cancel signals 
are subtracted from composite signals of individual collected 
audio signals of said one or two microphones, thereby 

15 performing echo cancellation, and wherein, using a sum signal 
and a difference signal of said stereo audio signals as 
reference signals, estimated errors of composite transfer 
functions of said two or four audio transfer systems are 
respectively derived, thereby to update corresponding filter 

20 characteristics to values that cancel said estimated errors. 

According to this invention, using as reference signals a set 
of a plurality of low-correlation composite signals which 
correspond to signals obtained by suitably combining multi- 
channel audio signals having a correlation therebetween and 

25 which have a lower correlation with each other than that 
between such multi -channel audio signals, the estimated 
errors of the composite transfer functions of said plurality 
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of audio transfer systems are respectively derived, so as to 
successively update the corresponding filter characteristics 
to the values that cancel the estimated errors, thereby to 
enable echo cancellation. In accordance therewith, since the 
5 multi-channel audio signals can be reproduced from the 
loudspeakers with no or less processing, which induces 
deterioration, applied to the multi-channel audio signals, 
excellent reproduced tone quality can be achieved. Further, 
there is no or only a small delay in reproduced signals. 

10 Thus, when applying to the teleconferencing system or the 
like, natural conversation can be conducted. It is also 
possible to update the filter characteristics in real time. 
The calculation of respectively deriving the estimated errors 
of the composite transfer functions of said two or four audio 

15 transfer systems using the sum signal and the difference 

signal of said stereo audio signals as the reference signals, 
may be, for example, a calculation of respectively deriving 
the estimated errors of the composite transfer functions of 
said two or four audio transfer systems based on a cross - 

20 spectrum calculation between the sum signal and the 

difference signal of said stereo audio signals and respective 
echo cancel error signals obtained by subtracting the 
corresponding echo cancel signals from the composite signals 
of the individual collected audio signals of said one or two 

25 microphones. Further, the calculation of respectively 
deriving the estimated errors of the composite transfer 
functions of said two or four audio transfer systems based on 
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the cross- spectrum calculation between the sum signal and the 
difference signal of said stereo audio signals and said echo 
cancel error signals, may be, for example, a calculation of 
deriving cross spectra between the sum signal and the 
5 difference signal of said stereo audio signals and said echo 
cancel error signals, and ensemble -averaging them in a 
predetermined time period per cross spectrum to derive 
estimated errors of the composite transfer functions of said 
two or four audio transfer systems. Further, the correlation 

10 between the sum signal and the difference signal of said 

stereo audio signals is detected and, when a value of said 
correlation is no less than a prescribed value, updating of 
said filter characteristics is stopped, thereby to prevent 
the echo cancel error signals from unexpectedly increasing. 

15 [0042] 

A stereo audio transmission method of this invention 
is such that, with respect to two spaces each forming said 
four audio transfer systems, any of the foregoing multi- 
channel echo cancel methods is carried out, so that the 

20 stereo audio signals, which have been echo-canceled by 

performing said method, are transmitted between said two 
spaces. In accordance therewith, the stereo audio 
transmission with reduced echo cancellation can be performed 
between two spots, which, for example, can be applied to the 

25 teleconferencing system or the like. 
[0043] 

A stereo echo canceller of this invention is a 
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stereo echo canceller wherein, with respect to a space 
provided therein with two loudspeakers and two microphones 
and forming four audio transfer systems in which stereo 
sounds reproduced by said respective loudspeakers are 
5 collected by said respective microphones, an audio signal 

supplied to the first loudspeaker is subjected to convolution 
calculations by first and second filter means, respectively, 
so as to produce first and second echo cancel signals, an 
audio signal supplied to the second loudspeaker is subjected 

10 to convolution calculations by third and fourth filter means, 
respectively, so as to produce third and fourth echo cancel 
signals, echo cancellation is performed by subtracting, using 
first subtracting means, said first and third echo cancel 
signals from a sum signal of collected audio signals of the 

15 respective microphones, and echo cancellation is performed by 
subtracting, using second subtracting means, said second and 
fourth echo cancel signals from a difference signal of the 
collected audio signals of the respective microphones, said 
stereo echo canceller comprising: transfer function 

20 calculating means for respectively deriving filter 

characteristics corresponding to composite transfer functions 
of said four audio transfer systems based on a cross-spectrum 
calculation between a sum signal and a difference signal of 
stereo audio signals to be reproduced by said respective 

25 loudspeakers and the sum signal and the difference signal of 
the respective microphone collected audio signals, thereby to 
set said derived filter characteristics to corresponding ones 
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of said first to fourth filter means, respectively. 
[0044] 

A stereo echo canceller of this invention is a 
stereo echo canceller wherein, with respect to a space 
5 provided therein with two loudspeakers and two microphones 
and forming four audio transfer systems in which stereo 
sounds reproduced by said respective loudspeakers are 
collected by said respective microphones, an audio signal 
supplied to the first loudspeaker is subjected to convolution 

10 calculations by first and second filter means, respectively, 
so as to produce first and second echo cancel signals, an 
audio signal supplied to the second loudspeaker is subjected 
to convolution calculations by third and fourth filter means, 
respectively, so as to produce third and fourth echo cancel 

15 signals, echo cancellation is performed by subtracting, using 
first subtracting means, said first and third echo cancel 
signals from a sum signal of collected audio signals of the 
respective microphones, and echo cancellation is performed by 
subtracting, using second subtracting means, said second and 

20 fourth echo cancel signals from a difference signal of the 
collected audio signals of the respective microphones, said 
stereo echo canceller comprising: transfer function 
calculating means for respectively deriving estimated errors 
of composite transfer functions of said four audio transfer 

25 systems based on a cross-spectrum calculation between a sum 
signal and a difference signal of stereo audio signals to be 
reproduced by said respective loudspeakers and respective 
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echo cancel error signals obtained by subtracting the 
corresponding echo cancel signals from the sum signal and the 
difference signal of the respective microphone collected 
audio signals, thereby to update filter characteristics of 
5 said first to fourth filter means to values that cancel said 
estimated errors, respectively. 
[0045] 

The stereo echo canceller of this invention may be 
further provided with correlation detecting means for 
10 detecting the correlation between the sum signal and the 

difference signal of said stereo audio signals and, when a 
value of said correlation is no less than a prescribed value, 
stopping updating of said filter characteristics, thereby to 
prevent the echo cancel error signals from unexpectedly 
15 increasing. 

[0046] 

A stereo sound transfer apparatus of this invention 
is such that, with respect to two spaces each forming said 
four audio transfer systems, any of said stereo echo 
20 cancellers is arranged in each space, so that the stereo 

audio signals, which have been echo-canceled by said stereo 
echo cancellers, are transmitted between said two spaces. 
[0047] 

[Inventions of Claims 32 to 35 and Inventions 
25 relating to such Inventions] 

A multi-channel echo cancel method of this invention 
is a method wherein, with respect to a space provided therein 
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with a plurality of loudspeakers and one or a plurality of 
microphones and forming a plurality of audio transfer systems 
in which mult i -channel sounds reproduced by said respective 
loudspeakers and having a correlation with each other are 
5 collected by said microphones, composite transfer functions 
of said plurality of audio transfer systems are estimated so 
as to set corresponding filter characteristics, respectively, 
echo cancel signals are respectively produced by giving said 
set filter characteristics to composite signals of individual 

10 signals to be reproduced by said respective loudspeakers, and 
said echo cancel signals are subtracted from composite 
signals of individual collected audio signals of said one or 
plurality of microphones, thereby performing echo 
cancellation, and wherein, using as reference signals a set 

15 of a plurality of low-correlation composite signals which 
correspond to signals obtained by suitably combining said 
multi-channel audio signals and which have a lower 
correlation with each other than that between said multi- 
channel audio signals (e.g. suitably combining said multi- 

20 channel audio signals to produce a plurality of low- 
correlation composite signals having a lower correlation with 
each other than that between said mult i -channel audio signals 
and using a set of said plurality of low-correlation 
composite signals as reference signals, or directly inputting 

25 a set of a plurality of low-correlation composite signals 
which correspond to signals obtained by suitably combining 
said multi- channel audio signals and which have a lower 
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correlation with each other than that between said multi- 
channel audio signals and using the set of said plurality of 
low-correlation composite signals as reference signals, or 
the like) , composite transfer functions of said plurality of 
5 audio transfer systems are respectively derived, thereby to 
set corresponding filter characteristics. According to this 
invention, using as reference signals a set of a plurality of 
low-correlation composite signals which correspond to signals 
obtained by suitably combining multi-channel audio signals 

10 having a correlation therebetween and which have a lower 
correlation with each other than that between such multi- 
channel audio signals, the composite transfer functions of 
said plurality of audio transfer systems are respectively 
derived, and corresponding filter characteristics are set, 

15 thereby to enable echo cancellation. In accordance therewith, 
since the multi-channel audio signals can be reproduced from 
the loudspeakers with no or less processing, which induces 
deterioration, applied to the multi-channel audio signals, 
excellent reproduced tone quality can be achieved. Further, 

20 there is no or only a small delay in reproduced signals. 

Thus, when applying to the teleconferencing system or the 
like, natural conversation can be conducted. The calculation 
of respectively deriving the composite transfer functions of 
said plurality of audio transfer systems using as the 

25 reference signals the set of the plurality of low-correlation 
composite signals, may be, for example, a calculation of 
respectively deriving the composite transfer functions of 



- 82 - 



said plurality of audio transfer systems based on a cross- 
spectrum calculation between the plurality of low-correlation 
composite signals and the composite signals of the individual 
collected audio signals of the respective microphones. 
5 Further, the calculation of respectively deriving the 

composite transfer functions of said plurality of audio 
transfer systems based on said cross -spectrum calculation, 
may be, for example, a calculation of combining said multi- 
channel audio signals through addition or subtraction to 

10 produce a plurality of low-correlation composite signals 

having a lower correlation with each other than that between 
said mult i -channel audio signals, deriving cross spectra 
between said plurality of low-correlation composite signals 
and the composite signals of the individual collected audio 

15 signals of the respective microphones, and ensemble-averaging 
them in a predetermined time period per cross spectrum to 
derive composite transfer functions of said plurality of 
audio transfer systems. 
[0048] 

20 A mult i -channel echo cancel method of this invention 

is a method wherein, with respect to a space provided therein 
with a plurality of loudspeakers and one or a plurality of 
microphones and forming a plurality of audio transfer systems 
in which multi-channel sounds reproduced by said respective 

2 5 loudspeakers and having a correlation with each other are 

collected by said microphones, composite transfer functions 
of said plurality of audio transfer systems are estimated so 
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as to set corresponding filter characteristics, respectively, 
echo cancel signals are respectively produced by giving said 
set filter characteristics to composite signals of individual 
signals to be reproduced by said respective loudspeakers, and 
5 said echo cancel signals are subtracted from composite 

signals of individual collected audio signals of said one or 
plurality of microphones, thereby performing echo 
cancellation, and wherein, using as reference signals a set 
of a plurality of low-correlation composite signals which 

10 correspond to signals obtained by suitably combining said 
multi- channel audio signals and which have a lower 
correlation with each other than that between said multi- 
channel audio signals (e.g. suitably combining said multi- 
channel audio signals to produce a plurality of low- 

15 correlation composite signals having a lower correlation with 
each other than that between said multi -channel audio signals 
and using a set of said plurality of low-correlation 
composite signals as reference signals, or directly inputting 
a set of a plurality of low-correlation composite signals 

2 0 which correspond to signals obtained by suitably combining 
said multi -channel audio signals and which have a lower 
correlation with each other than that between said multi- 
channel audio signals and using the set of said plurality of 
low-correlation composite signals as reference signals, or 

25 the like) , estimated errors of composite transfer functions 
of said plurality of audio transfer systems are respectively 
derived, thereby to update corresponding filter 
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characteristics to values that cancel said estimated errors. 
According to this invention, using as reference signals a set 
of a plurality of low-correlation composite signals which 
correspond to signals obtained by suitably combining multi- 
5 channel audio signals having a correlation therebetween and 
which have a lower correlation with each other than that 
between such multi-channel audio signals, the estimated 
errors of the composite transfer functions of said plurality 
of audio transfer systems are respectively derived so as to 

10 successively update the corresponding filter characteristics 
to the values that cancel said estimated errors, thereby to 
enable echo cancellation. In accordance therewith, since the 
multi-channel audio signals can be reproduced from the 
loudspeakers with no or less processing, which induces 

15 deterioration, applied to the multi-channel audio signals, 

excellent reproduced tone quality can be achieved. Further, 
there is no or only a small delay in reproduced signals. 
Thus, when applying to the teleconferencing system or the 
like, natural conversation can be conducted. It is also 

20 possible to update the filter characteristics in real time. 

The calculation of respectively deriving the estimated errors 
of the composite transfer functions of said plurality of 
audio transfer systems using the set of the plurality of low- 
correlation composite signals as the reference signals, may 

25 be, for example, a calculation of respectively deriving the 
estimated errors of the composite transfer functions of said 
plurality of audio transfer systems based on a cross-spectrum 
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calculation between said plurality of low-correlation 
composite signals and echo cancel error signals obtained by 
subtracting the corresponding echo cancel signals from the 
composite signals of the individual collected audio signals 
5 of said one or plurality of microphones. Further, the 

calculation of respectively deriving the estimated errors of 
the composite transfer functions of said plurality of audio 
transfer systems based on said cross -spectrum calculation, 
may be, for example, a calculation of combining said multi- 

10 channel audio signals through addition or subtraction to 
produce a plurality of low-correlation composite signals 
having a lower correlation with each other than that between 
said mult i -channel audio signals, deriving cross spectra 
between said plurality of low-correlation composite signals 

15 and the echo cancel error signals obtained by subtracting the 
corresponding echo cancel signals from the composite signals 
of the individual collected audio signals of said one or 
plurality of microphones, and ensemble -averaging them in a 
predetermined time period per cross spectrum to derive 

20 estimated errors of the composite transfer functions of said 
plurality of audio transfer systems. Further, the 
correlation between said plurality of low-correlation 
composite signals is detected and, when a value of said 
correlation is no less than a prescribed value, updating of 

25 said filter characteristics is stopped, thereby to prevent 
the echo cancel error signals from unexpectedly increasing. 
[0049] 
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A multi -channel sound transfer method of this 
invention is such that, with respect to two spaces each 
forming said plurality of audio transfer systems, any of the 
foregoing multi -channel echo cancel methods is carried out, 
5 so that the multi-channel audio signals, which have been 
echo-canceled by performing said method, are transmitted 
between said two spaces. In accordance therewith, the multi- 
channel audio transmission with reduced echo cancellation can 
be performed between two spots, which, for example, can be 
10 applied to the teleconferencing system or the like. 
[0050] 

A stereo echo cancel method of this invention is a 
method wherein, with respect to a space provided therein with 
two loudspeakers and one or two microphones and forming two 

15 or four audio transfer systems in which stereo sounds 

reproduced by said respective loudspeakers are collected by 
said microphones, composite transfer functions of said two or 
four audio transfer systems are estimated so as to set 
corresponding filter characteristics, respectively, echo 

20 cancel signals are respectively produced by giving said set 
filter characteristics to composite signals of individual 
signals to be reproduced by said respective loudspeakers, and 
said echo cancel signals are subtracted from composite 
signals of individual collected audio signals of said one or 

25 two microphones, thereby performing echo cancellation, and 
wherein, using a sum signal and a difference signal of said 
stereo audio signals as reference signals, composite transfer 
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functions of said two or four audio transfer systems are 
respectively derived, thereby to set corresponding filter 
characteristics. According to this invention, since the sum 
signal and the difference signal of the stereo audio signals 
5 have a low correlation therebetween, estimated errors of the 
composite transfer functions of the two or four audio 
transfer systems are respectively derived using the sum 
signal and the difference signal as reference signals, so as 
to successively update the corresponding filter 

10 characteristics to values that cancel said estimated errors, 

thereby to enable echo cancellation. In accordance therewith, 
since the stereo signals can be reproduced from the 
loudspeakers with no or less processing, which induces 
deterioration, applied to the stereo signals, excellent 

15 reproduced tone quality can be achieved. Further, there is 
no or only a small delay in reproduced signals. Thus, when 
applying to the teleconferencing system or the like, natural 
conversation can be conducted. The calculation of 
respectively deriving the composite transfer functions of 

20 said two or four audio transfer systems using the sum signal 
and the difference signal of said stereo audio signals as the 
reference signals, may be, for example, a calculation of 
respectively deriving the composite transfer functions of 
said two or four audio transfer systems based on a cross - 

25 spectrum calculation between the sum signal and the 
difference signal, and the composite signals of the 
individual collected audio signals of the respective 
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microphones. Further, the calculation of respectively 
deriving the composite transfer functions of said two or four 
audio transfer systems based on said cross -spectrum 
calculation, may be, for example, a calculation of deriving 
5 cross spectra between the sum signal and the difference 

signal of said stereo audio signals and the composite signals 
of the individual collected audio signals of the respective 
microphones, and ensemble -averaging them in a predetermined 
time period per cross spectrum to derive composite transfer 
10 functions of said two or four audio transfer systems. 
[0051] 

A stereo echo cancel method of this invention is a 
method wherein, with respect to a space provided therein with 
two loudspeakers and one or two microphones and forming two 

15 or four audio transfer systems in which stereo sounds 

reproduced by said respective loudspeakers are collected by 
said microphones, composite transfer functions of said two or 
four audio transfer systems are estimated so as to set 
corresponding filter characteristics, respectively, echo 

20 cancel signals are respectively produced by giving said set 
filter characteristics to composite signals of individual 
signals to be reproduced by said respective loudspeakers, and 
said echo cancel signals are subtracted from composite 
signals of individual collected audio signals of said one or 

25 two microphones, thereby performing echo cancellation, and 
wherein, using a sum signal and a difference signal of said 
stereo audio signals as reference signals, estimated errors 
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of composite transfer functions of said two or four audio 
transfer systems are respectively derived, thereby to update 
corresponding filter characteristics to values that cancel 
said estimated errors. According to this invention, using as 
5 reference signals a set of a plurality of low-correlation 
composite signals which correspond to signals obtained by 
suitably combining multi- channel audio signals having a 
correlation therebetween and which have a lower correlation 
with each other than that between such multi -channel audio 

10 signals, the estimated errors of the composite transfer 

functions of said plurality of audio transfer systems are 
respectively derived, so as to successively update the 
corresponding filter characteristics to the values that 
cancel the estimated errors, thereby to enable echo 

15 cancellation. In accordance therewith, since the multi- 
channel audio signals can be reproduced from the loudspeakers 
with no or less processing, which induces deterioration, 
applied to the multi -channel audio signals, excellent 
reproduced tone quality can be achieved. Further, there is 

20 no or only a small delay in reproduced signals. Thus, when 
applying to the teleconferencing system or the like, natural 
conversation can be conducted. It is also possible to update 
the filter characteristics in real time. The calculation of 
respectively deriving the estimated errors of the composite 

25 transfer functions of said two or four audio transfer systems 
using the sum signal and the difference signal of said stereo 
audio signals as the reference signals, may be, for example, 
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a calculation of respectively deriving the estimated errors 
of the composite transfer functions of said two or four audio 
transfer systems based on a cross-spectrum calculation 
between the sum signal and the difference signal of said 
5 stereo audio signals and respective echo cancel error signals 
obtained by subtracting the corresponding echo cancel signals 
from the composite signals of the individual collected audio 
signals of said one or two microphones. Further, the 
calculation of respectively deriving the estimated errors of 

10 the composite transfer functions of said two or four audio 
transfer systems based on the cross-spectrum calculation 
between the sum signal and the difference signal of said 
stereo audio signals and said echo cancel error signals, may 
be, for example, a calculation of deriving cross spectra 

15 between the sum signal and the difference signal of said 

stereo audio signals and said echo cancel error signals, and 
ensemble -averaging them in a predetermined time period per 
cross spectrum to derive estimated errors of the composite 
transfer functions of said two or four audio transfer systems. 

20 Further, the correlation between the sum signal and the 

difference signal of said stereo audio signals is detected 
and, when a value of said correlation is no less than a 
prescribed value, updating of said filter characteristics is 
stopped, thereby to prevent the echo cancel error signals 

25 from unexpectedly increasing. 
[0052] 

A stereo audio transmission method of this invention 
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is such that, with respect to two spaces each forming said 
four audio transfer systems, any of the foregoing multi- 
channel echo cancel methods is carried out, so that the 
stereo audio signals, which have been echo-canceled by 
5 performing said method, are transmitted between said two 
spaces. In accordance therewith, the stereo audio 
transmission with reduced echo cancellation can be performed 
between two spots, which, for example, can be applied to the 
teleconferencing system or the like. 

10 [0053] 

A stereo echo canceller of this invention is a 
stereo echo canceller wherein, with respect to a space 
provided therein with two loudspeakers and two microphones 
and forming four audio transfer systems in which stereo 

15 sounds reproduced by said respective loudspeakers are 

collected by said respective microphones, a sum signal of 
stereo audio signals to be reproduced by said respective 
loudspeakers is subjected to convolution calculations by 
first and second filter means, respectively, so as to produce 

20 first and second echo cancel signals, a difference signal of 
the stereo audio signals to be reproduced by said respective 
loudspeakers is subjected to convolution calculations by 
third and fourth filter means, respectively, so as to produce 
third and fourth echo cancel signals, echo cancellation is 

25 performed by subtracting, using first subtracting means, said 
first and third echo cancel signals from a sum signal of 
collected audio signals of the respective microphones, and 
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echo cancellation is performed by subtracting, using second 
subtracting means, said second and fourth echo cancel signals 
from a difference signal of the collected audio signals of 
the respective microphones, said stereo echo canceller 
5 comprising: transfer function calculating means for 

respectively deriving filter characteristics corresponding to 
composite transfer functions of said four audio transfer 
systems based on a cross-spectrum calculation between the sum 
signal and the difference signal of the stereo audio signals 

10 to be reproduced by said respective loudspeakers and the sum 
signal and the difference signal of the respective microphone 
collected audio signals, thereby to set said derived filter 
characteristics to corresponding ones of said first to fourth 
filter means, respectively. 

15 [0054] 

A stereo echo canceller of this invention is a 
stereo echo canceller wherein, with respect to a space 
provided therein with two loudspeakers and two microphones 
and forming four audio transfer systems in which stereo 

20 sounds reproduced by said respective loudspeakers are 

collected by said respective microphones, a sum signal of 
stereo audio signals to be reproduced by said respective 
loudspeakers is subjected to convolution calculations by 
first and second filter means, respectively, so as to produce 

25 first and second echo cancel signals, a difference signal of 
the stereo audio signals to be reproduced by said respective 
loudspeakers is subjected to convolution calculations by 
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third and fourth filter means, respectively, so as to produce 
third and fourth echo cancel signals, echo cancellation is 
performed by subtracting, using first subtracting means, said 
first and third echo cancel signals from a sum signal of 
5 collected audio signals of the respective microphones, and 
echo cancellation is performed by subtracting, using second 
subtracting means, said second and fourth echo cancel signals 
from a difference signal of the collected audio signals of 
the respective microphones, said stereo echo canceller 

10 comprising: transfer function calculating means for 

respectively deriving estimated errors of composite transfer 
functions of said four audio transfer systems based on a 
cross-spectrum calculation between the sum signal and the 
difference signal of the stereo audio signals to be 

15 reproduced by said respective loudspeakers and respective 
echo cancel error signals obtained by subtracting the 
corresponding echo cancel signals from the sum signal and the 
difference signal of the respective microphone collected 
audio signals, thereby to update filter characteristics of 

20 said first to fourth filter means to values that cancel said 
estimated errors, respectively. 
[0055] 

The stereo echo canceller of this invention may be 
further provided with correlation detecting means for 
25 detecting the correlation between the sum signal and the 

difference signal of said stereo audio signals and, when a 
value of said correlation is no less than a prescribed value, 
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stopping updating of said filter characteristics, thereby to 
prevent the echo cancel error signals from unexpectedly 
increasing. 

[0056] 

A stereo sound transfer apparatus of this invention 
is such that, with respect to two spaces each forming said 
four audio transfer systems, any of said stereo echo 
cancellers is arranged in each space, so that the stereo 
audio signals, which have been echo-canceled by said stereo 
echo cancellers, are transmitted between said two spaces. 
[0057] 

[Inventions of Claims 36 to 40 and Inventions 
relating to such Inventions] 

A transfer function calculation apparatus of this 
invention is a transfer function calculation apparatus which, 
with respect to a space provided therein with a plurality of 
loudspeakers and one or a plurality of microphones and 
forming a plurality of audio transfer systems in which multi- 
channel sounds inputted from an outside and reproduced by 
said respective loudspeakers and having a correlation with 
each other are collected by said microphones, estimates 
individual transfer functions of said plurality of audio 
transfer systems or a plurality of composite transfer 
functions obtained by suitably combining said individual 
transfer functions, wherein, using as reference signals a set 
of a plurality of low-correlation composite signals which 
correspond to signals obtained by suitably combining said 
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multi- channel audio signals and which have a lower 
correlation with each other than that between said multi- 
channel audio signals (e.g. suitably combining said multi- 
channel audio signals to produce a plurality of low- 
5 correlation composite signals having a lower correlation with 
each other than that between said multi-channel audio signals 
and using a set of said plurality of low-correlation 
composite signals as reference signals, or directly inputting 
a set of a plurality of low-correlation composite signals 

10 which correspond to signals obtained by suitably combining 
said multi-channel audio signals and which have a lower 
correlation with each other than that between said multi- 
channel audio signals and using the set of said plurality of 
low-correlation composite signals as reference signals, or 

15 the like) , individual transfer functions of the respective 
audio transfer systems or a plurality of composite transfer 
functions obtained by suitably combining said individual 
transfer functions are estimated. The calculation of 
respectively deriving the individual transfer functions of 

20 the respective audio transfer systems or the plurality of 

composite transfer functions obtained by suitably combining 
said individual transfer functions, using as the reference 
signals the set of the plurality of low-correlation composite 
signals, may be, for example, a calculation of respectively 

25 deriving the individual transfer functions of the respective 
audio transfer systems or the plurality of composite transfer 
functions obtained by suitably combining said individual 
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transfer functions, based on a cross-spectrum calculation 
between the plurality of low-correlation composite signals 
and the individual collected audio signals of the microphones, 
or the plurality of composite signals obtained by suitably 
5 combining said individual collected audio signals. Further, 
the calculation of respectively deriving the individual 
transfer functions of said plurality of audio transfer 
systems or the plurality of composite transfer functions 
obtained by suitably combining said individual transfer 

10 functions, based on said cross-spectrum calculation, may be, 
for example, a calculation of respectively deriving the 
individual transfer functions of said plurality of audio 
transfer systems or the plurality of composite transfer 
functions obtained by suitably combining said individual 

15 transfer functions, by combining said multi-channel audio 
signals through addition or subtraction to produce a 
plurality of low-correlation composite signals having a lower 
correlation with each other than that between said multi- 
channel audio signals, deriving cross spectra between said 

2 0 plurality of low-correlation composite signals and the 

individual collected audio signals of the microphones, or the 
plurality of composite signals obtained by suitably combining 
said individual collected audio signals, and ensemble- 
averaging them in a predetermined time period per cross 

25 spectrum. The calculation of respectively deriving the 
individual transfer functions of said plurality of audio 
transfer systems based on said cross -spectrum calculation may 



- 97 - 



also be a calculation of respectively deriving the individual 
transfer functions of said plurality of audio transfer 
systems by producing a plurality of mutually orthogonal 
uncorrelated composite signals by applying a principal 
5 component analysis to said mult i -channel audio signals, 

deriving cross spectra between said plurality of uncorrelated 
composite signals and the individual collected audio signals 
of the microphones, and ensemble-averaging them in a 
predetermined time period per cross spectrum. 

10 [0058] 

A transfer function calculation apparatus of this 
invention is a transfer function calculation apparatus which, 
with respect to a space provided therein with two 
loudspeakers and two microphones and forming four audio 

15 transfer systems in which stereo sounds reproduced by said 
respective loudspeakers are collected by said respective 
microphones, estimates individual transfer functions of said 
four audio transfer systems or a plurality of composite 
transfer functions obtained by suitably combining said 

20 individual transfer functions, wherein, using a sum signal 
and a difference signal of said stereo audio signals as 
reference signals, individual transfer functions of said four 
audio transfer systems or a plurality of composite transfer 
functions obtained by suitably combining said individual 

25 transfer functions are estimated. The calculation of 

respectively deriving the individual transfer functions of 
said four audio transfer systems or the plurality of 
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composite transfer functions obtained by suitably combining 
said individual transfer functions, using the sum signal and 
the difference signal of said stereo audio signals as the 
reference signals, may be, for example, a calculation of 
5 respectively deriving the individual transfer functions of 
said four audio transfer systems or the plurality of 
composite transfer functions obtained by suitably combining 
said individual transfer functions, based on a cross-spectrum 
calculation between said sum signal and said difference 

10 signal, and individual collected audio signals of the 

microphones, or a plurality of composite signals obtained by 
suitably combining said individual collected audio signals. 
Further, the calculation of respectively deriving the 
individual transfer functions of said four audio transfer 

15 systems or the plurality of composite transfer functions 
obtained by suitably combining said individual transfer 
functions, based on said cross-spectrum calculation, may be a 
calculation of deriving cross spectra between the sum signal 
and the difference signal of said stereo audio signals and 

20 the individual collected audio signals of the microphones, or 
the plurality of composite signals obtained by suitably 
combining said individual collected audio signals, and 
ensemble -averaging them in a predetermined time period per 
cross spectrum, thereby to respectively derive the individual 

25 transfer functions of said four audio transfer systems or the 
plurality of composite transfer functions obtained by 
suitably combining said individual transfer functions. 
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[0059] 

A transfer function calculation apparatus of this 
invention is a transfer function calculation apparatus which, 
with respect to a space provided therein with two 
5 loudspeakers and two microphones and forming four audio 

transfer systems in which stereo sounds reproduced by said 
respective loudspeakers are collected by said respective 
microphones, estimates individual transfer functions of said 
four audio transfer systems, wherein mutually orthogonal two 

10 uncorrelated composite signals are produced by applying a 
principal component analysis to said stereo audio signals, 
and individual transfer functions of said four audio transfer 
systems are estimated using a set of said two uncorrelated 
composite signals as reference signals. The calculation of 

15 respectively deriving the individual transfer functions of 

said four audio transfer systems using said two uncorrelated 
composite signals as the reference signals, may be, for 
example, a calculation of respectively deriving the 
individual transfer functions of said four audio transfer 

20 systems based on a cross-spectrum calculation between said 
two uncorrelated composite signals and the individual 
collected audio signals of the microphones. Further, the 
calculation of respectively deriving the individual transfer 
functions of said four audio transfer systems based on said 

25 cross-spectrum calculation, may be, for example, a 

calculation of deriving cross spectra between said two 
uncorrelated composite signals and the individual collected 
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audio signals of the microphones , and ensemble -averaging them 
in a predetermined time period per cross spectrum, thereby to 
respectively derive the individual transfer functions of said 
four audio transfer systems. In this case, by making 
5 relatively longer a time period of performing said ensemble 
averaging when double talk is detected where sounds other 
than those reproduced by said loudspeakers are inputted into 
said microphones, while making it relatively shorter when the 
double talk is not detected, it is possible to fully converge 
10 the estimated errors when the double- talk exists, and further, 
quicken the convergence of the estimated errors when there is 
no double talk . 

[0060] 

[Inventions of Claims 41 to 51 and Inventions 

15 relating to such Inventions] 

An inventive echo cancel method is associated to a 
space provided therein with a plurality of loudspeakers and 
one or a plurality of microphones for forming a plurality of 
audio transfer systems through which audio signals of multi- 

20 channels having a correlation with each other are reproduced 
by said respective loudspeakers and are collected by said 
microphones, and designed for performing an echo cancellation 
by subtracting an echo cancel signal from the audio signals 
collected by the respective microphone or from composite 

25 signals obtained by combining the collected audio signals. 

The inventive method comprises inputting a plurality of low- 
correlation audio signals which are obtained by suitably 
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combining first audio signals of multi-channels and which 
have a lower correlation with each other than that among said 
first audio signals of multi -channels , generating second 
audio signals of multi-channels having a correlation with 
5 each other by computation based on the inputted low- 
correlation audio signals, feeding the generated second audio 
signals to the respective loudspeakers so as to reproduce 
audio sounds, feeding the generated second audio signals or 
the inputted low-correlation audio signals to filters, 

10 estimating individual transfer functions of said plurality of 
said audio transfer systems or a plurality of composite 
transfer functions obtained by suitably combining said 
individual transfer functions based on the inputted low- 
correlation audio signals so as to set corresponding filter 

15 characteristics, producing echo cancel signals by applying 
said set filter characteristics to the second audio signals 
or the low-correlation audio signals fed to the filters, and 
subtracting said echo cancel signals from collected audio 
signals obtained by collecting the reproduced audio sounds by 

2 0 the microphones or from composite audio signals obtained by 
suitably combining said collected audio signals, thereby 
performing the echo cancellation. 

Preferably, in the inventive echo cancel method, the 
inputted low-correlation audio signals are obtained by adding 

25 or subtracting the first audio signals of mult i -channels with 
each other. 

Another inventive echo cancel method is associated 
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to a space provided therein with a plurality of loudspeakers 
and one or a plurality of microphones for forming a plurality 
of audio transfer systems through which audio signals of 
multi -channels having a correlation with each other are 
5 reproduced by said respective loudspeakers and are collected 
by said microphones, and designed for performing an echo 
cancellation by subtracting an echo cancel signal from the 
audio signals collected by the respective microphone or from 
composite signals obtained by combining the collected audio 

10 signals. The inventive method comprises inputting a 

plurality of first low-correlation audio signals which are 
obtained by suitably combining first audio signals of multi- 
channels and which have a lower correlation with each other 
than that among said first audio signals of multi -channels, 

15 generating second audio signals of multi -channels having a 
correlation with each other by computation based on the 
inputted first low-correlation audio signals, feeding the 
generated second audio signals to the respective loudspeakers 
so as to reproduce audio sounds, generating second low- 

20 correlation audio signals of multi -channels based on the 

generated second audio signals, feeding the generated second 
audio signals or the generated second low-correlation audio 
signals to filters, estimating individual transfer functions 
of said plurality of said audio transfer systems or a 

25 plurality of composite transfer functions obtained by 

suitably combining said individual transfer functions based 
on the generated second low-correlation audio signals so as 
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to set corresponding filter characteristics, producing echo 
cancel signals by applying said set filter characteristics to 
the second audio signals or the second low-correlation audio 
signals fed to the filters, and subtracting said echo cancel 
5 signals from collected audio signals obtained by collecting 
the reproduced audio sounds at the microphones or from 
composite audio signals obtained by suitably combining said 
collected audio signals, thereby performing the echo 
cancellation . 

10 Preferably, in the inventive echo cancel method, the 

inputted first low-correlation audio signals are obtained by 
adding or subtracting the first audio signals of multi- 
channels with each other. 

An inventive echo canceller is associated to a space 

15 provided therein with a plurality of loudspeakers and one or 
a plurality of microphones for forming a plurality of audio 
transfer systems through which audio signals of multi- 
channels having a correlation with each other are reproduced 
by said respective loudspeakers and are collected by said 

20 microphones, and designed for performing an echo cancellation 
by subtracting an echo cancel signal from the audio signals 
collected by the respective microphone or from composite 
signals obtained by combining the collected audio signals . 
The inventive echo canceller comprises an inputting means for 

25 inputting a plurality of low-correlation audio signals which 
are obtained by suitably combining first audio signals of 
mult i -channels and which have a lower correlation with each 
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other than that among said first audio signals of multi- 
channels, a demodulating means provided for generating second 
audio signals of multi-channels having a correlation with 
each other by demodulating the inputted low-correlation audio 
5 signals, and for feeding the generated second audio signals 
to the respective loudspeakers so as to reproduce audio 
sounds, an estimating means for estimating individual 
transfer functions of said plurality of said audio transfer 
systems or a plurality of composite transfer functions 

10 obtained by suitably combining said individual transfer 

functions based on the inputted low-correlation audio signals 
so as to set corresponding filter characteristics, a filter 
means for producing echo cancel signals by applying said set 
filter characteristics to the second audio signals or the 

15 low-correlation audio signals fed to the filter means, and a 
subtracting means for subtracting said echo cancel signals 
from collected audio signals obtained by collecting the 
reproduced audio sounds at the microphones or from composite 
audio signals obtained by suitably combining said collected 

20 audio signals, thereby performing the echo cancellation. 

Preferably, in the inventive echo canceller, the 
inputted low-correlation audio signals are obtained by adding 
or subtracting the first audio signals of multi-channels with 
each other. 

25 Another inventive echo canceller is associated to a 

space provided therein with a plurality of loudspeakers and 
one or a plurality of microphones for forming a plurality of 
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audio transfer systems through which audio signals of multi- 
channels having a correlation with each other are reproduced 
by said respective loudspeakers and are collected by said 
microphones, and designed for performing an echo cancellation 
5 by subtracting an echo cancel signal from the audio signals 
collected by the respective microphone or from composite 
signals obtained by combining the collected audio signals. 
The inventive echo canceller comprises an inputting means for 
inputting a plurality of first low-correlation audio signals 

10 which are obtained by suitably combining first audio signals 
of mult i -channels and which have a lower correlation with 
each other than that among said first audio signals of multi- 
channels, a demodulating means provided for generating second 
audio signals of multi-channels having a correlation with 

15 each other by demodulating the inputted first low-correlation 
audio signals, and for feeding the generated second audio 
signals to the respective loudspeakers so as to reproduce 
audio sounds, an estimating means provided for generating 
second low-correlation audio signals of multi-channels based 

20 on the generated second audio signals, and for estimating 

individual transfer functions of said plurality of said audio 
transfer systems or a plurality of composite transfer 
functions obtained by suitably combining said individual 
transfer functions based on the generated second low- 

25 correlation audio signals so as to set corresponding filter 
characteristics, a filter means for producing echo cancel 
signals by applying said set filter characteristics to the 
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second audio signals or the second low-correlation audio 
signals fed to the filter means, and a subtracting means for 
subtracting said echo cancel signals from collected audio 
signals obtained by collecting the reproduced audio sounds at 
5 the microphones or from composite audio signals obtained by 
suitably combining said collected audio signals, thereby 
performing the echo cancellation. 

Preferably, in the inventive echo canceller, the 
inputted first low-correlation audio signals are obtained by 

10 adding or subtracting the first audio signals of multi- 
channels with each other. 

Preferably, in a inventive multi -channel echo 
canceller, the multi-channel audio signals being inputted 
from an outside and having a correlation with each other are 

15 reproduced by said respective loudspeakers without lowering 
the correlation of the inputted multi-channel audio signals. 

Preferably, the multi-channel audio signals being 
inputted from an outside and having a correlation with each 
other are provisionally modulated to lower the correlation, 

20 then demodulated to restore the correlation, and thereafter 
reproduced by said respective loudspeakers. Further, the 
multi-channel audio signals are provisionally modulated to 
lower the correlation by adding and subtracting the multi- 
channel audio signals with each other, or by orthogonal! zing 

25 the multi-channel audio signals with each other. 

In this invention, "audio signal" is not limited to 
human voices, but covers all acoustic signals in the audible 
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frequency band. 

Brief Description of the Drawings 

Fig . 1 is a block diagram showing a structural 
5 example in a stereo echo canceller 16, 24 of Fig. 2. 

Fig. 2 is a block diagram showing an embodiment of a 
stereo sound transfer apparatus of this invention. 

Fig. 3 is a diagram showing the simulation 
measurement results of the echo cancel performance of the 
10 stereo echo canceller 16, 24 of Fig. 1. 

Fig. 4 is a diagram showing the simulation 
measurement results of the echo cancel performance of the 
stereo echo canceller 16, 24 of Fig. 1. 

Fig. 5 is a diagram showing the simulation 
15 measurement results of the echo cancel performance of the 
stereo echo canceller 16, 24 of Fig. 1. 

Fig. 6 is a diagram showing the simulation 
measurement results of the echo cancel performance of the 
stereo echo canceller 16, 24 of Fig. 1. 
2 0 Fig. 7 is a diagram showing the simulation 

measurement results of the echo cancel performance of the 
stereo echo canceller 16, 24 of Fig. 1. 

Fig. 8 is a diagram showing the simulation 
measurement results of the echo cancel performance of the 
25 stereo echo canceller 16, 24 of Fig. 1. 

Fig. 9 is a block diagram showing another structural 
example in the stereo echo canceller 16, 24 of Fig. 2. 
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Fig. 10 is a block diagram showing another 
structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 

Fig. 11 is a block diagram showing another 
5 structural example in the stereo echo canceller 16, 24 of Fig. 
2. 

Fig. 12 is a block diagram showing another 
structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 

10 Fig. 13 is a block diagram showing another 

structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 

Fig. 14 is a block diagram showing another 
structural example in the stereo echo canceller 16, 24 of Fig. 
15 2 . 

Fig. 15 is a block diagram showing another 
structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 

Fig. 16 is a block diagram showing another 
20 structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 

Fig. 17 is a block diagram showing another 
structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 

2 5 Fig. 18 is a block diagram showing another 

structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 
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Fig. 19 is a block diagram showing another 
structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 

Fig. 2 0 is a block diagram showing another 
5 structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 

Fig. 21 is a block diagram showing another 
structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 

10 Fig. 22 is a block diagram showing another 

structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 

Fig. 23 is a block diagram showing another 
structural example in the stereo echo canceller 16, 24 of Fig. 
15 2 . 

Fig. 24 is a block diagram showing a modification of 
the structures of Figs. 1, 9 to 11. 

Fig. 25 is a block diagram showing a modification of 
the structures of Figs. 12 to 15. 
20 Fig. 26 is a block diagram showing a modification of 

the structures of Figs. 16 to 19. 

Fig. 27 is a block diagram showing a modification of 
the structures of Figs . 20 and 21 . 

Fig. 2 8 is a block diagram showing another 
25 structural example in the stereo echo canceller 16, 24 of Fig. 
2 . 

Fig. 29 is a time chart showing an example of unit 
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intervals for orthogonalization processing and deriving an 
impulse response or its estimated error in the stereo echo 
canceller of Fig. 28. 

Fig. 3 0 is a diagram showing one example of 
5 functional blocks of an orthogonal i zing filter 500 of Fig. 28. 

Fig. 31 is a diagram showing one example of 
functional blocks of transfer function calculating means 502 
of Fig. 28. 

Fig. 32 is a functional block diagram showing a 
10 modification of the transfer function calculating means 502 
of Fig. 31. 

Fig. 33 is a diagram showing the simulation 
measurement result with respect to a time-domain variation in 
echo cancellation amount of the stereo echo canceller 16, 24 

15 of Fig. 28 when there is no double talk. 

Fig. 34 is a diagram showing the simulation 
measurement result with respect to a time-domain variation, in 
echo cancellation amount of the stereo echo canceller 16, 24 
of Fig. 28 when there is no double talk. 

20 Fig. 35 is a diagram showing the simulation 

measurement result with respect to a time-domain variation in 
echo cancellation amount of the stereo echo canceller 16, 24 
of Fig. 2 8 when there is no double talk. 

Fig. 3 6 is a diagram showing the simulation 

25 measurement result with respect to a time-domain variation in 
echo cancellation amount of the stereo echo canceller 16, 24 
of Fig. 28 when there is no double talk. 
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Fig. 37 is a diagram showing the simulation 
measurement result with respect to a time-domain variation in 
echo cancellation amount of the stereo echo canceller 16, 24 
of Fig. 2 8 when there is double talk. 
5 Fig. 38 is a diagram showing the simulation 

measurement result with respect to a time-domain variation in 
echo cancellation amount of the stereo echo canceller 16 , 24 
of Fig. 28 when there is double talk. 

Fig. 3 9 is a diagram showing the simulation 
10 measurement result with respect to a time-domain variation in 
echo cancellation amount of the stereo echo canceller 16, 24 
of Fig. 2 8 when there is double talk. 

Fig. 40 is a diagram showing the simulation 
measurement result with respect to a time-domain variation in 
15 echo cancellation amount of the stereo echo canceller 16, 24 
of Fig. 2 8 when there is double talk. 

Fig. 41 is a diagram showing the simulation 
measurement result with respect to a time-domain variation in 
transfer function estimated error of the stereo echo 
20 canceller 16, 24 of Fig. 28 when there is no double talk. 

Fig. 42 is a diagram showing the simulation 
measurement result with respect to a time-domain variation in 
transfer function estimated error of the stereo echo 
canceller 16, 24 of Fig. 28 when there is no double talk. 
25 Fig. 43 is a diagram showing the simulation 

measurement result with respect to a time-domain variation in 
transfer function estimated error of the stereo echo 
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canceller 16, 24 of Fig. 28 when there is no double talk. 

Fig. 44 is a diagram showing the simulation 
measurement result with respect to a time-domain variation in 
transfer function estimated error of the stereo echo 
canceller 16, 24 of Fig. 28 when there is no double talk. 

Fig. 4 5 is a diagram showing the simulation 
measurement result with respect to a time-domain variation in 
transfer function estimated error of the stereo echo 
canceller 16, 24 of Fig. 28 when there is double talk. 

Fig. 46 is a diagram showing the simulation 
measurement result with respect to a time-domain variation in 
transfer function estimated error of the stereo echo 
canceller 16, 24 of Fig. 28 when there is double talk. 

Fig. 4 7 is a diagram showing the simulation 
measurement result with respect to a time-domain variation in 
transfer function estimated error of the stereo echo 
canceller 16, 24 of Fig. 28 when there is double talk. 

Fig. 48 is a diagram showing the simulation 
measurement result with respect to a time-domain variation in 
transfer function estimated error of the stereo echo 
canceller 16, 24 of Fig. 28 when there is double talk. 

Fig. 4 9 is an exemplary diagram for explaining an 
ensemble averaging process according to overlap processing. 

Fig. 50 is a block diagram showing a modification of 
the stereo echo canceller 16, 24 of Fig. 28. 

Fig. 51 is a block diagram showing a structural 
example wherein the number of microphones is modified to one 
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in Fig. 1. 

Best Mode for Carrying Out the Invention 
[0061] 

5 Embodiments of the present invention will be 

described hereinbelow. Fig. 2 shows the whole structure of a 
two-way stereo sound transfer apparatus according to this 
invention. This is for performing two-way stereo 
transmission between a spot A and a spot B and, for example, 

10 is applicable to the teleconferencing system. In the spot A, 
two loudspeakers SP-A(L) and SP-A(R) and two microphones MC- 
A(L) and MC-A(R) are arranged in one space. Collected audio 
signals of the microphones MC-A(L) and MC-A(R) are converted 
into digital signals at A/D converters 12 and 14, 

15 respectively, and applied with echo cancel processing at a 
stereo echo canceller 16 , then modulated at a CODEC (CODER 
and DECODER) 18 and transmitted to the spot B via a wire or 
radio transmission line 20. In the spot B, two loudspeakers 
SP-B(L) and SP-B (R) and two microphones MC-B(L) and MC-B(R) 

20 are arranged in one space. Incidentally, assuming a 

situation that participants in the spots A and B talk with 
each other in a face- to- face manner in the teleconferencing 
system, loudspeakers and microphones are arranged such that a 
sound collected by a microphone on the participant's left in 

2 5 one spot is reproduced from a loudspeaker on the 

participant's right in the other spot whereas a sound 
collected by a microphone on the participant's right in one 
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spot is reproduced from a loudspeaker on the participants 
left in the other spot. Specifically, when the loudspeaker 
SP-A(L) and the microphone MC-A(L) are arranged on the 
participant's left and the loudspeaker SP-A(R) and the 
5 microphone MC-A(R) are arranged on the participant's right in 
the spot A # the loudspeaker SP-B(L) and the microphone MC- 
B(L) are arranged on the participant's right and the 
loudspeaker SP-B(R) and the microphone MC-B (R) are arranged 
on the participant's left in the spot B. 

10 [0062] 

The signals transmitted from the spot A are inputted 
into a CODEC 22 where the collected audio signals of the 
microphones MC-A(L) and MC-A(R) are demodulated. These 
demodulated collected audio signals of the microphones MC- 

15 A(L) and MC-A(R) are respectively converted into analog 
signals at D/A converters 26 and 28 via a stereo echo 
canceller 24, and respectively reproduced at the loudspeakers 
SP-B(L) and SP-B (R) . Collected audio signals of the 
microphones MC-B(L) and MC-B (R) in the spot B are converted 

20 into digital signals at A/D converters 30 and 32 , 

respectively, and applied with echo cancel processing at the 
stereo echo canceller 24, then modulated at the CODEC 22 and 
transmitted to the spot A via the transmission line 20. The 
signals transmitted to the spot A are inputted into the CODEC 

25 18 where the collected audio signals of the microphones MC- 
B(L) and MC-B (R) are demodulated. These demodulated 
collected audio signals of the microphones MC-B (L) and MC- 
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B (R) are respectively converted into analog signals at D/A 
converters 34 and 3 6 via the stereo echo canceller 16, and 
respectively reproduced at the loudspeakers SP-A(L) and SP- 
A(R) . 
5 [0063] 

Fig. 1 shows a structural example in the stereo echo 
canceller 16, 24. Left/right two-channel stereo signals x L 
and x R transmitted from the spot on the counterpart side and 
inputted into line input ends LI (L) and LI (R) are outputted 

10 from sound output ends SO(L) and SO(R) as they are (i.e. not 
through sum/difference signal producing means 52) , and 
reproduced at loudspeakers SP(L) {representing SP-A(L) or SP- 
B (L) } and SP(R) {representing SP-A(R) or SP-B (R) } , 
respectively . 

15 [0064] 

Filer means 40-1 to 40-4 are formed by, for example, 
FIR filters. Of them, the filter means 40-1 is set with an 
impulse response corresponding to a transfer function between 
the loudspeaker SP(L) and a microphone MC(L) {representing 

20 MC-A(L) or MC-B(L)} and performs, using such an impulse 
response, a convolution calculation of a signal to be 
outputted from the sound output end S0(L), thereby producing 
an echo cancel signal EC1 corresponding to a signal obtained 
such that the signal outputted from the sound output end 

25 S0(L) is reproduced at the loudspeaker SP(L), collected by 
the microphone MC(L) and inputted into a sound input end 
SI (L) . The filter means 40-2 is set with an impulse response 
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corresponding to a transfer function between the loudspeaker 
SP(L) and a microphone MC(R) {representing MC-A(R) or MC- 
B (R) } and performs, using such an impulse response, a 
convolution calculation of a signal to be outputted from the 
5 sound output end SO(L), thereby producing an echo cancel 

signal EC2 corresponding to a signal obtained such that the 
signal outputted from the sound output end SO(L) is 
reproduced at the loudspeaker SP(L), collected by the 
microphone MC(R) and inputted into a sound input end SI (R) . 

10 The filter means 40-3 is set with an impulse response 

corresponding to a transfer function between the loudspeaker 
SP(R) and the microphone MC(L) and performs, using such an 
impulse response, a convolution calculation of a signal to be 
outputted from the sound output end SO(R), thereby producing 

15 an echo cancel signal EC3 corresponding to a signal obtained 
such that the signal outputted from the sound output end 
SO(R) is reproduced at the loudspeaker SP (R) , collected by 
the microphone MC(L) and inputted into the sound input end 
SI (L) - The filter means 40-4 is set with an impulse response 

2 0 corresponding to a transfer function between the loudspeaker 
SP(R) and the microphone MC(R) and performs, using such an 
impulse response, a convolution calculation of a signal to be 
outputted from the sound output end S0(R), thereby producing 
an echo cancel signal EC4 corresponding to a signal obtained 

2 5 such that the signal outputted from the sound output end 

SO(R) is reproduced at the loudspeaker SP(R), collected by 
the microphone MC(R) and inputted into the sound input end 
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SI(R) . 

[0065] 

An adder 44 performs a calculation of EC1+EC3 . An 
adder 46 performs a calculation of EC2+EC4. A subtracter 48 
5 subtracts an echo cancel signal EC1+EC3 from a collected 

audio signal of the microphone MC(L) inputted from the sound 
input end SI (L) , thereby to perform echo cancellation. A 
subtracter 50 subtracts an echo cancel signal EC2+EC4 from a 
collected audio signal of the microphone MC(R) inputted from 

10 the sound input end SI (R) , thereby to perform echo 

cancellation. These echo-canceled signals of the respective 
left and right channels are outputted from line output ends 
LO(L) and L0(R), respectively, and transmitted toward the 
spot on the counterpart side. 

15 [0066] 

The sum/difference signal producing means 52 
performs addition, using an adder 54, of the left/right two- 
channel stereo signals x L and x R inputted into the line input 
ends LI (L) and LI (R) so as to produce a sum signal x L +x R , 

2 0 while performs subtraction thereof using a subtracter 56 so 

as to produce a difference signal x L -x R (or it may also be x R - 
x L ) . In case of the left/right two-channel stereo signals, 
the sum signal x L +x R and the difference signal x L -x R are in 
general low in correlation therebetween, and frequently, 

25 approximately uncorrelated . Transfer function calculating 

means 58 implements a cross-spectrum calculation between the 
sum signal x L +x R and the difference signal x L -x R produced by 



- 118 - 



the sum/difference signal producing means 52 and signals e L 
and e R output ted from the subtracters 4 8 and 50 and, based on 
this cross-spectrum calculation, sets filter characteristics 
(impulse responses) of the filter means 40-1 to 40-4. 
5 Specifically, upon starting the system, the filter 

characteristics of the filter means 40-1 to 40-4 are not set, 
i.e. coefficients are all set to zero, so that the echo 
cancel signals EC1 to EC4 are zero, and thus the collected 
audio signals of the microphones MC(L) and MC (R) themselves 

10 are outputted from the subtracters 48 and 50. Therefore, at 
this time, the transfer function calculating means 58 
performs the cross- spectrum calculation between the sum 
signal x L +x R and the difference signal x L -x R produced by the 
sum/difference signal producing means 52 and the collected 

15 audio signals e L and e R of the microphones MC(L) and MC(R) 

outputted from the subtracters 48 and 50 and, based on this 
cross-spectrum calculation, derives transfer functions of 
four audio transfer systems between the loudspeakers SP(L) 
and SP(R) and the microphones MC(L) and MC(R), respectively, 

20 and implements initial setting of the filter characteristics 
of the filter means 40-1 to 40-4 to values corresponding to 
such transfer functions. After the initial setting, since 
the echo cancel signals are produced by the filter means 40-1 
to 40-4, the echo cancel error signals e L and e R 

25 corresponding to difference signals between the collected 

audio signals of the microphones MC(L) and MC (R) and the echo 
cancel signals EC1 to EC4 are outputted from the subtracters 



- 119 - 



48 and 50. Therefore, at this time, the transfer function 
calculating means 58 performs the cross-spectrum calculation 
between the sum signal x L +x R and the difference signal x L -x R 
produced by the sum/difference signal producing means 52 and 
5 the echo cancel error signals e L and e R outputted from the 
subtracters 48 and 50 and, based on this cross-spectrum 
calculation, derives estimated errors of the transfer 
functions of the four audio transfer systems between the 
loudspeakers SP(L) and SP (R) and the microphones MC(L) and 

10 MC(R), respectively, and updates the filter characteristics 
of the filter means 40-1 to 40-4 to values that cancel the 
estimated errors, respectively. By repeating this updating 
operation per prescribed time period, the echo cancel error 
can be converged to a minimum value. Further, even if the 

15 transfer functions change due to movement of the microphone 

positions or the like, the echo cancel error can be converged 
to a minimum value by sequentially updating the filter 
characteristics of the filter means 40-1 to 40-4 depending 
thereon . 

20 [0067] 

Correlation detecting means 60 detects a correlation 
between the sum signal x L +x R and the difference signal x L -x R 
based on a correlation value calculation or the like, and 
stops updating of the foregoing filter characteristics when 

25 the correlation value is no less than a prescribed value. 

When the correlation value becomes lower than the prescribed 
value, updating of the foregoing filter characteristics is 
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restarted. Incidentally, as a concrete technique for 
deriving the correlation between the sum signal x L +x R and the 
difference signal x L -x R/ . any of the known techniques for 
detecting a correlation of two signals may be used. 
5 [0068] 

Herein, the filter characteristics (impulse 
responses) that are set to the filter means 40-1 to 40-4 by 
the transfer function calculating means 58 will be described. 
In the following description, the transfer functions and the 
10 filter characteristics are expressed using the following 
symbols . 

• Hxx : a transfer function (frequency-axis 
expression) 

•h^ : an impulse response (time-axis expression) 
15 corresponding to Hxx 

"H^xx: an estimated transfer function (a transfer 

function set to a filter) 
•h^xx- an impulse response corresponding to H^xx 
•AHxx- a transfer function estimated error 
2 0 "Ahxx: an impulse response corresponding to AHxx 

(note: Suitable symbols are allocated to xx.) 
The sum signal and the difference signal are 
respectively defined as follows. 

sum signal : x M (= x L +x R ) 

25 difference signal : x s (= x L -x R ) 

The transfer functions of the four audio transfer 
systems between the loudspeakers SP(L) and SP(R) and the 
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microphones MC(L) and MC(R) are respectively defined as 
follows . 

H LL : a transfer function of the system from the 

loudspeaker SP(L) to the microphone MC(L) 
H LR : a transfer function of the system from the 

loudspeaker SP(L) to the microphone MC(R) 
Hrl : a transfer function of the system from the 

loudspeaker SP(R) to the microphone MC(L) 
Hrr: a transfer function of the system from the 

loudspeaker SP(R) to the microphone MC(R) 
The input signals x L and x R at the line input ends 
LI (L) and LI (R) of the stereo echo canceller 16, 24 are 
replaced by 

x L = (x M +x s ) /2 
x R = (Xm-Xs) /2 . 
[0069] 

Then, a calculation shown below is carried out in 
the transfer function calculating means 58. The signals e L 
and e R {the collected audio signals of the microphones MC(L) 
and MC(R) as they are before the initial setting of the 
filter means 40-1 to 40-4 whereas the echo cancel error 
signals after the initial setting} outputted from the 
subtracters 48 and 50, assuming that frequency-axis 
expressions of the signals x M , x S/ e L and e R are respectively 
given as X M/ X s , E L and E R , become 

E L = { (X M +X s )H LL /2} + { (X M -X s )H RL /2} 

E R = { (X M +X s )H LR /2} + { (X M -X s )H RR /2} 
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hence 

2 El = (X M +X S )H LL + (X m -X s )Hri, 

= Xm (Hll+Hrl) + (Hll~Hrj j ) ... (1) 

2Er = (Xm+X s ) H L r + (X m -X s )Hrr 
5 — Xm(Hlr+Hrr) + Xs (Hlr-Hrr) . . . (2) . 

When both sides of the equation (1) are multiplied 
by complex conjugates X M * and X s * of X M and X s (i.e. deriving 
cross spectra) and ensemble -averaged, 

SX M *-2E L = 2X M *-X M (H LL +HR Ii ) + ZX M * • X s (Hll-Hrl) ...(3) 
10 £X S *-2E L = SX s *-Xm(H ll +Hrl) + EX S * • X s (Hll-Hrl) ...(4) 

are respectively obtained. Similarly, when both sides of the 
equation (2) are multiplied by complex conjugates X M * and X s * 
of X M and X s , 

SX M *-2E R = 2X m *-Xm(H L r+Hrr) + EX m *-X s (H lr -Hrr) ...(5) 
15 £X s *-2Er = SX s *-X m (H L r+Hrr) + 2X S * • X s (H lr -Hrr) ...(6) 

are respectively obtained. 
[0070] 

In the equations (3) to (6) , since X M and X s are 
approximately uncorrelated with each other, such a term 
20 having X M *-X S or X S *-X M becomes approximately zero when 
ensemble - averaged . Further , 

X M *-X M = |X M | 2 

X s *-X s = | X s | 2 

hence, the equations (3) to (6) respectively become 
25 £X M *-2E L = S |X M | 2 (H ll +Hrl) . . . (3 » ) 

SX S *-2E L = S |X S | 2 (Hll-Hrl) . . . (4 ' ) 
SX M *-2E R = S|X m | 2 (H lr +Hrr) ...(5') 
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2X S *-2E R = S (XsI^Hlr-Hrr) ...(6'). 

By transforming the equations (3') to (6')/ the 
following composite transfer functions each in the form of 
combination of two transfer functions are respectively 
derived. 

Hll+Hrl = £X M * • 2E L /S | X M | 2 . . . ( 3 " ) 
H LL -H RL = SX S *-2E L /S|X S | 2 ...(4") 
Hlr+Hrr = SX M *-2E R /S|X M | 2 ...(5") 
Hlr-Hrr = £X s *-2Er/2|X s | 2 ...(6") 

When the corresponding sides of the equations (3") 
and (4") are added together, 

H LL = (SX M *-E L /S |X M | 2 ) + (SX S *-E L /2|X S | 2 ) ...(7). 

When subtraction is performed between the 
corresponding sides of the equations (3 M ) and (4"), 

Hrl = (SX M *-E L /S |X M | 2 ) - (SX S *-E L /S|X S | 2 ) ...(8). 

When the corresponding sides of the equations (5") 
and (6") are added together, 

Hlr = (2X M *-E R /2 |X M | 2 ) + (£X s *-Er/2 |X S | 2 ) ...(9). 

When subtraction is performed between the 
corresponding sides of the equations (5") and (6"), 

Hrr = (ZX M *-E R /£ |X M | 2 ) - (SX S *-E R /S |X S | 2 ) ...(10). 

Hence, transfer functions H LL/ H RL/ H L r and Hrr are 
respectively derived. Impulse responses h LL/ h RL , h LR and h^ 
obtained by applying the inverse Fourier transformation to 
these derived transfer functions are the filter 
characteristics to be set to the filter means 40-1, 40-2, 40 
3 and 40-4, respectively. Therefore, the transfer function 
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calculating means 58 derives the respective transfer 
functions H LL , Hrl, H lr and H RR from the equations (7) to (10) 
based on the sum signal x M/ the difference signal x s , and the 
output signals e L and e R of the subtracters 48 and 50 that 
are inputted, derives the impulse responses h LL , h RL/ h LR and 
h-KR by applying the inverse Fourier transformation to those 
derived transfer functions, sets the derived impulse 
responses to the filter means 40-1, 40-2, 40-3 and 40-4 as 
h^LL/ h^ RL/ h^ LR and h^, respectively, and further, updates 
the impulse responses by repeating this calculation per 
suitably determined prescribed time period (e.g. time period 
of performing ensemble averaging) . 
[0071] 

When the foregoing impulse response updating 
technique is explained using estimated error parameters, it 
becomes as follows. Output signals (collected audio signals) 
y L and y R of the microphones MC(L) and MC(R), assuming that 
frequency-axis expressions of x L , x R , y L and y R are 
respectively given as X L , X R , Y L and Y R , become 

Y L = (X M +X s )H LL /2 + (X M -X s )H RL /2 

Y R = (X M +X s )H LR /2 + (X M -X s )H RR /2. 

The signal E L output ted from the subtracter 4 8 

becomes 

El = Yl - (X L "H ll +.X R *H R l) 

= { (X M +X s )H LL /2 + (X„-X s )H RL /2} - { (X„+X s )H" LL /2 
+ (X M -X s )H" RL /2} 

hence 
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2 El = Xm (Hll+Hrl~H ll~H rl ) + Xs (Hll-Hrl-H ll+H rl ) ... (11) . 
The signal E R outputted from the subtracter 50 

becomes 

Er — Y R - (Xl'H^lR + Xr'H rr) 

5 = { (Xm+X s )H lr /2 + (X m -X s )Hrr/2} - { (X M +X S ) H" LR /2 

+ (X m -X s )H"rr/2} 

hence 

2Er = Xm (Hlr+Hrr-H^lr-H^rr) + X s (Hlr-Hrr-H lr+H rr) ... (12) . 
[0072] 

10 When the estimated errors are given as 



AH LL 


~ Hll * 




AHrl 


= Hrl - 


" H^rl 


AH LR 


= H L r - 


" H^ L R 


AHrr 


= Hrr " 


- H^RR 



15 the equations (11) and (12) become 

2 El = Xm (AHll+AHrl) + X s (AH LL -AH RL ) ...(11') 

2E R = XmCAHlr+AHrr) + X s (AH L r-AHrr) . . . (12' ) . 

When both sides of the equation (11') are multiplied 

by complex conjugates X M * and X s * of X M and X s (i.e. deriving 
20 cross spectra) and ensemble -averaged, 

SX M *-2E L = SX„* -X M ( AHll+AHrl) + SX M * • X s ( AH LL - 

AHrl) . . - (13) 

SXs*-2E L = SX S *-X M (AHll+AHrl) + SX S * • X s ( AH LL - 

AHrl) ... (14) 

25 are respectively obtained. Similarly, when both sides of the 
equation (12') are multiplied by complex conjugates X M * and 
X s * of X M and X s , 
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SX M *-2E R = 2X M * -XmCAHlr+AHrr) + SX M * • X s ( AHlr- 
AHrr) . . . (15) 

£X S *-2E R = SX S * " X M ( AH L r+AHrr) + SX S * • X s ( AH LR - 
AHrr) . . . (16) 
5 are respectively obtained. 

[0073] 

In the equations (13) to (16) , since X M and X s are 
approximately uncorrelated with each other, such a term 
having X M *-X S or X S *-X M becomes approximately zero when 
10 ensemble -averaged. Further, 

X M *-X M = |X M | 2 

X s *-X s = | X s | 2 

hence, the equations (13) to (16) respectively become 





SX„*- 


2 El = S |X„| 2 (AHll+AHrl) 


. . . (13 ' ) 




15 


SX S * ' 


2 El = S |X S | 2 (AHll-AHrl) 


... (14') 






sx„*- 


2E r = S|X„| 2 (AHlr+AHrr) 


. . . (15 ' ) 






SX S *- 


2E r = S|X s | 2 (AHlr-AHrr) 


. . . (16 ' ) . 






From 


the equations (13 ') to (16 1 ), 






AH LL 


= SX M *-E L /S 1 X M 1 2 + SX S * 


•E L /S|Xs| 2 . 


. . (17) 


20 


AHrl 


= SX M *-E L /S |X m | 2 - SX S * 


•El/S|X s | 2 . 


. . (18) 




AH LR 


= SX M *-E R /S 1 X M | 2 + SX S * 


•Er/s|x s | 2 . 


. . (19) 




AHrr 


= SX„*-E R /S|X M | 2 - SX S * 


•Er/s|x s | 2 . 


. . (20) 



are respectively derived. 



[0074] 

25 Using the estimated errors AH LL/ AHrl, AH lr and AHrr 

derived from the equations (17) to (20) , the filter 
characteristics of the filter means 40-1, 40-2, 40-3 and 40-4 
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are updated per suitably determined prescribed time period 
(e.g. time period of performing ensemble averaging) . For 
example, assuming that impulse responses h LL , Iirl, h L R and Iirr 
after K-th updating are given as h LL (k) , h^k), h LR (k) and 
5 hRR(k) , using impulse responses Ah LL , AIirl, Ah LR and Ah^ 

corresponding to the derived estimated errors AH LL/ AH RL/ AH LR 
and AH^, 

h LL (k+l) = h LL (k) + aAh LL ...(21) 
h RL (k+l) = h^k) + aAh RL ...(22) 
10 h LR (k+l) = h LR (k) + aAh LR ...(23) 

hRR(k+l) = hRR(k) + aAhRR ...(24) 
where a is a suitably set convergence coefficient. 

Using these updating equations, (k+l)th impulse 
responses h LL (k+l) / h RL (k-fl) / h LR (k+l) and h^k+l) are derived 
15 and set to the filter means 40-1, 40-2, 40-3 and 40-4, 
respectively, which is repeated per suitably determined 
prescribed time period (e.g. time period of performing 
ensemble averaging) . 
[0075] 

20 The results of carrying out simulations using the 

signals x L and x R or the sum and difference signals x M and x s 
as reference signals with respect to the stereo echo 
canceller 16, 24 of Fig. 1, are shown in Figs. 3 to 8 for 
each of the audio transfer systems. As the signals x L and x R , 

2 5 stereo audio signals based on a human voice were used. In 

Figs. 3 to 8, the axis of abscissas represents the number of 
blocks (one block represents a time period of performing 
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ensemble averaging and is set to about 2.3 seconds in the 
simulations) , and the filter characteristics are updated per 
block. The filter characteristics are not set in the first 
block, the initial setting is executed in the second block, 
5 then updating is carried out per block. The axis of 

ordinates represents the estimated error (dB) of the transfer 
function, and the initial state where the filter 
characteristics are not set is defined as OdB. Table 1 shows 
the conditions of the respective simulations of Figs. 3 to 8 . 
10 [Table 1] 



Figure Number 


Reference Signal 


Double Talk 


Change in 
Transfer System 


FIG. 3 


X L / X R 


NO 


NO 


FIG. 4 


Xm/ Xs 


NO 


NO 


FIG. 5 


Xl / Xr 


YES 


NO 


FIG. 6 


X M/ Xs 


YES 


NO 


FIG. 7 


Xl / Xr 


YES 


YES 


FIG. 8 


Xm / Xs 


YES 


YES 



[0076] 

In Table 1, "Double Talk" represents the state where 
15 sounds reproduced from the loudspeakers SP (L) and SP(R) as 

well as a voice uttered by a person present in that room are 
simultaneously collected by the microphones MC(L) and MC(R). 
Since the teleconferencing system is used in general in the 
state where the double talk occurs, it is required that the 
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sufficient echo cancel performance can be achieved even when 
the double talk occurs. On the other hand, in Table 1, 
"Change in Transfer System" represents changing the transfer 
functions while the estimated errors are converging, 
5 supposing a case where the microphone positions are moved, or 
the like. As an operation of the echo canceller, it is 
required that even if the estimated error temporarily 
increases due to change in transfer function, it again go 
toward convergence . 

10 [0077] 

The simulation results of Figs. 3 to 8 are 
considered. Comparing Figs. 3 and 4, when the signals x L and 
x R are used as reference signals (Fig. 3) , because a 
correlation between the signals x L and x R is high, estimated 

15 errors only drop to about -15 to -25dB at most in the 20th 

block (about 45 seconds) from the start of the operation. In 
contrast, when the signals x M and x s are used as reference 
signals (Fig. 4) , because a correlation between the signals 
x M and x s is low, estimated errors drop to about -35 to -45dB 

20 in the 20th block from the start of the operation, and thus 

it is seen that the sufficient echo cancel performance can be 
achieved. Comparing Figs. 5 and 6 showing cases where the 
double talk exists, when the signals x L and x R are used as 
reference signals (Fig. 5) , estimated errors only drop to 

25 about -10 to -20dB at most in the 20th block from the start 
of the operation. In contrast, when the signals x M and x s 
are used as reference signals (Fig. 6) , estimated errors drop 
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to about -23 to -3 0dB, and thus it is seen that even if the 
double talk exists, the sufficient echo cancel performance 
can be achieved. This is because, since a voice uttered by a 
person present in a room is uncorrelated with sounds 
5 reproduced from the loudspeakers SP(L) and SP (R) , when the 
transfer function calculating means 58 calculates transfer 
functions of the respective systems, components of the voice 
uttered by the person in the room are canceled through the 
foregoing cross- spectrum calculation and ensemble -average 

10 calculation, so that the transfer functions of the respective 
systems can be derived with no influence of the double talk. 
Comparing Figs. 7 and 8 showing cases where the double talk 
and the change of the transfer systems are present, when the 
signals x L and x R are used as reference signals (Fig. 7), 

15 convergence of estimated errors is poor after giving a change 
to the transfer systems in the 11th block so that the 
estimated errors only drop to about -5 to -15dB at most in 
the 2 0th block. In contrast, when the signals x M and x s are 
used as reference signals (Fig. 8), convergence of estimated 

20 errors is excellent even after the change is given to the 
transfer systems in the 11th block so that the estimated 
errors drop to about -17 to -2 6dB in the 2 0th block, and thus 
it is seen that even if the double talk and the change of the 
transfer systems exist, the sufficient echo cancel 

25 performance can be achieved. 
[0078] 

Fig. 9 shows another structural example in the 
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stereo echo canceller 16, 24 of Fig. 2, wherein 
sum/difference signal producing means is arranged on 
transmission lines to loudspeakers. The same symbols are 
used with respect to those portions common to the foregoing 
5 structure of Fig. 1. Left/right two-channel stereo signals 
x L and x R transmitted from the spot on the counterpart side 
and inputted into line input ends LI (L) and LI (R) are 
inputted into sum/difference signal producing means 52 . The 
sum/difference signal producing means 52 performs addition of 

10 the stereo signals x L and x R using an adder 54 so as to 

produce a sum signal x M (= x L +x R ) , while performs subtraction 
thereof using a subtracter 56 so as to produce a difference 
signal x s {= x L -x R (or it may also be x R -x L ) } . Stereo audio 
signal demodulating means 62 performs addition of the sum and 

15 difference signals x M and x s using an adder 64, and further, 
gives thereto a coefficient 1/2 using a coefficient 
multiplier 66 to recover the original signal x L/ while 
performs subtraction of the sum and difference signals x M and 
x s using a subtracter 68, and further, gives thereto a 

20 coefficient 1/2 using a coefficient multiplier 70 to recover 
the original signal x R . The recovered signals x L and x R are 
outputted from sound output ends S0(L) and SO(R) and 
reproduced at loudspeakers SP(L) and SP(R), respectively. 
[0079] 

25 Transfer function calculating means 58 implements a 

cross-spectrum calculation between the sum signal x M and the 
difference signal x s produced by the sum/difference signal 
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producing means 52 and signals e L and e R outputted from 
subtracters 48 and 50 and, based on this cross-spectrum 
calculation, performs setting and updating of filter 
characteristics of filter means 40-1 to 40-4. Operations 
5 thereof are the same as those described with respect to the 
structure of Fig. 1. Operations of the other portions are 
also the same as those described with respect to the 
structure of Fig. 1. 
[0080] 

10 Fig. 10 shows another structural example in the 

stereo echo canceller 16, 24 of Fig. 2, wherein transmission 
is implemented between the spots A and B of Fig. 2 in the 
signal form of the sum signal x M and the difference signal x s , 
instead of the signal form of the stereo signals x L and x R . 

15 The same symbols are used with respect to those portions 
common to the foregoing structure of Fig. 1 or 9. A sum 
signal x M (= x L +x R ) and a difference signal x s {= x L -x R (or it 
may also be x R -x L ) } transmitted from the spot on the 
counterpart side and inputted into line input ends LI (L) and 

2 0 LI (R) are inputted into stereo audio signal demodulating 
means 62. The stereo audio signal demodulating means 62 
performs addition of the sum and difference signals x M and x s 
using an adder 64, and further, gives thereto a coefficient 
1/2 using a coefficient multiplier 66 to recover the original 

25 signal x L , while performs subtraction of the sum and 

difference signals x M and x s using a subtracter 68, and 
further, gives thereto a coefficient 1/2 using a coefficient 



- 133 - 



multiplier 70 to recover the original signal x R . The 
recovered signals x L and x R are outputted from sound output 
ends SO(L) and SO(R) and reproduced at loudspeakers SP(L) and 
SP(R) , respectively. 
5 [0081] 

Transfer function calculating means 58 implements a 
cross-spectrum calculation between the sum signal x M and the 
difference signal x s inputted from the line input ends LI (L) 
and LI (R) and signals e L and e R outputted from subtracters 48 

10 and 50 and, based on this cross -spectrum calculation, 

performs setting and updating of filter characteristics of 
filter means 40-1 to 40-4. Operations thereof are the same 
as those described with respect to the structure of Fig. 1 or 
9. Sum/difference signal producing means 72 performs 

15 addition, using an adder 73, of the signals e L and e R 

outputted from the subtracters 4 8 and 50 so as to produce a 
sum signal e M (= e L +e R ) , while performs subtraction thereof 
using a subtracter 75 so as to produce a difference signal e s 
{= e L -e R (or it may also be e R -e L ) } , then sends them toward 

2 0 the spot on the counterpart side. Operations of the other 

portions are the same as those described with respect to the 
structure of Fig. 1 or 9 . 
[0082] 

Fig. 11 shows another structural example in the 
25 stereo echo canceller 16, 24 of Fig. 2. The same symbols are 
used with respect to those portions common to the foregoing 
structure of Fig. 1, 9 or 10. A sum signal x M (= x L +x R ) and a 
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difference signal x s {= x L -x R (or it may also be x R -x L ) } 
transmitted from the spot on the counterpart side and 
inputted into line input ends LI (L) and LI (R) are inputted 
into stereo audio signal demodulating means 62 . The stereo 
5 audio signal demodulating means 62 performs addition of the 
sum and difference signals x M and x s using an adder 64, and 
further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 66 to recover the original signal x L , while 
performs subtraction of the sum and difference signals x M and 

10 x s using a subtracter 68, and further, gives thereto a 

coefficient 1/2 using a coefficient multiplier 70 to recover 
the original signal x R . The recovered signals x L and x R are 
outputted from sound output ends SO(L) and SO(R) and 
reproduced at loudspeakers SP(L) and SP(R), respectively. 

15 [0083] 

Sum/difference signal producing means 52 performs 
addition, using an adder 54, of the stereo signals x L and x R 
recovered by the stereo audio signal demodulating means 62 so 
as to produce a sum signal x M (= x L +x R ) , while performs 

2 0 subtraction thereof using a subtracter 56 so as to produce a 
difference signal x s {= x L -x R (or it may also be x R -x L ) } . 
Transfer function calculating means 58 implements a cross- 
spectrum calculation between the sum signal x M and the 
difference signal x s produced by the sum/difference signal 

25 producing means 52 and signals e L and e R outputted from 
subtracters 48 and 50 and, based on this cross-spectrum 
calculation, performs setting and updating of filter 
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characteristics of filter means 40-1 to 40-4. Operations 
thereof are the same as those described with respect to the 
structure of Fig. 1, 9 or 10 . Sum/difference signal 
producing means 72 performs addition, using an adder 73, of 
5 the signals e L and e R output ted from the subtracters 4 8 and 

50 so as to produce a sum signal e M (= e L +e R ) , while performs 
subtraction thereof using a subtracter 75 so as to produce a 
difference signal e s {= e L -e R (or it may also be e R -e L ) } , then 
sends them toward the spot on the counterpart side . 
10 Operations of the other portions are the same as those 

described with respect to the structure of Fig. 1, 9 or 10. 
[0084] 

Fig. 12 shows a structural example in the stereo 
echo canceller 16, 24. Left/right two-channel stereo signals 

15 x L and x R transmitted from the spot on the counterpart side 
and inputted into line input ends LI (L) and LI (R) are 
outputted from sound output ends SO(L) and SO(R) as they are 
(i.e. not through sum/difference signal producing means 152), 
and reproduced at loudspeakers SP(L) and SP(R), respectively. 

20 [0085] 

The sum/difference signal producing means 152 
performs addition, using an adder 154, of the left/right two- 
channel stereo signals x L and x R inputted into the line input 
ends LI (L) and LI (R) so as to produce a sum signal x M (= 

25 x L +x R ) , while performs subtraction thereof using a subtracter 
156 so as to produce a difference signal x s {= x L -x R (or x R - 
x L ) } . 
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[0086] 

Filer means 140-1 to 140-4 are formed by, for 
example, FIR filters. These filter means 140-1 to 140-4 are 
each set with an impulse response corresponding to a 
5 composite transfer function in the form of combination of 
transfer functions of suitable two systems among transfer 
functions H LL , H LR , Hrl and Hrr of four audio transfer systems 
between the loudspeakers SP(L) and SP(R) and microphones 
MC(L) and MC(R), respectively, and perform a convolution 
10 calculation of the sum and difference signals (low- 
correlation composite signals) using such impulse responses, 
thereby producing echo cancel signals EC1 to EC4 , 
respectively . 

[0087] 

15 An adder 144 performs a calculation of EC1+EC3 . An 

adder 146 performs a calculation of EC2+EC4 . A subtracter 
148 subtracts an echo cancel signal EC1+EC3 from a collected 
audio signal of the microphone MC(L) inputted from a sound, 
input end SI (L.) , thereby to perform echo cancellation. A 

20 subtracter 150 subtracts an echo cancel signal EC2+EC4 from a 
collected audio signal of the microphone MC(R) inputted from 
a sound input end SI (R) , thereby to perform echo cancellation. 
These echo-canceled signals of the respective left and right 
channels are outputted from line output ends LO(L) and LO(R), 

25 respectively, and transmitted toward the spot on the 
counterpart side. 
[0088] 
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Transfer function calculating means 158 implements a 
cross -spectrum calculation between the sum signal x M and the 
difference signal x s produced by the sum/difference signal 
producing means 152 and signals e L and e R outputted from the 
subtracters 148 and 150 and, based on this cross- spectrum 
calculation, performs setting and updating of filter 
characteristics (impulse responses) of the filter means 140-1 
to 140-4. Specifically, upon starting the system, the filter 
characteristics of the filter means 140-1 to 140-4 are not 
set, i.e. coefficients are all set to zero, so that the echo 
cancel signals EC1 to EC4 are zero, and thus the collected 
audio signals of the microphones MC(L) and MC(R) themselves 
are outputted from the subtracters 148 and 150. Therefore, 
at this time, the transfer function calculating means 158 
performs the cross -spectrum calculation between the sum 
signal x M and the difference signal x s produced by the 
sum/difference signal producing means 152 and the collected 
audio signals e L and e R of the microphones MC(L) and MC(R) 
outputted from the subtracters 148 and 150 and, based on this 
cross -spectrum calculation, derives a plurality of composite 
transfer functions each in the form of combination of 
transfer functions of suitable two systems among transfer 
functions H LL/ H LR/ H RL and of four audio transfer systems 
between the loudspeakers SP(L) and SP (R) and the microphones 
MC(L) and MC(R), respectively, and implements initial setting 
of the filter characteristics of the filter means 140-1 to 
140-4 to values corresponding to such composite transfer 
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functions. After the initial setting, since the echo cancel 
signals are produced by the filter means 140-1 to 140-4, the 
echo cancel error signals e L and e R corresponding to 
difference signals between the collected audio signals of the 
5 microphones MC(L) and MC(R) and the echo cancel signals EC1 
to EC4 are outputted from the subtracters 148 and 150. 
Therefore, at this time, the transfer function calculating 
means 158 performs the cross-spectrum calculation between the 
sum signal x M and the difference signal x s produced by the 

10 sum/difference signal producing means 152 and the echo cancel 
error signals e L and e R outputted from the subtracters 14 8 
and 150 and, based on this cross-spectrum calculation, 
derives estimated errors of the foregoing composite transfer 
functions, respectively, and updates the filter 

15 characteristics of the filter means 140-1 to 140-4 to values 
that cancel such estimated errors, respectively. By 
repeating this updating operation per prescribed time period, 
the echo cancel error can be converged to a minimum value. 
Further, even if the transfer functions change due to 

20 movement of the microphone positions or the like, the echo 
cancel error can be converged to a minimum value by 
sequentially updating the filter characteristics of the 
filter means 140-1 to 140-4 depending thereon. 
[0089] 

25 Correlation detecting means 160 detects a 

correlation between the sum signal x M and the difference 
signal x s based on a correlation value calculation or the 
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like, and stops updating of the foregoing filter 
characteristics when the correlation value is no less than a 
prescribed value. When the correlation value becomes lower 
than the prescribed value, updating of the foregoing filter 
characteristics is restarted. 
[0090] 

Herein, the filter characteristics (impulse 
responses) that are set to the filter means 140-1 to 140-4 by 
the transfer function calculating means 158 will be described. 
In the transfer function calculating means 158, the following 
calculation is performed. 
(In case of Fixed Type Operation) 

The signals x L and x R are 

x L = (x M +x s )/2 

x R = (x M -x s ) /2 

hence, output signals Y L and Y R of the microphones MC(L) and 
MC(R) become 

Y L = (X M +X s )H LL /2 + (X M -X s )H RL /2 

= X M { (H LL +H RL ) /2} + X s { (Hll-Hrl) /2 } ...(25) 

Y R = (X M +X s )H LR /2 + (X„-X s )H RR /2 

= X M { (Hlr+HrrJ/2} + X s { (H LR -H RR )/2} ...(26). 

When the composite transfer functions are given as 

Hml ~ (Hll+Hj^l) /2 

Hsl = (Hll~H R l)/2 

Hmr - (Hl R +H rr )/2 

Hsr = (H LR -H RR )/2 
the equations (25) and (26) respectively become 
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Y L = X„-H ML + Xs-H SL ...(25') 
Y R = Xm-Hmr + X S -H S R ...{26'). 

When both sides of the equations (25') and (26') are 
multiplied by complex conjugates X M * and X s * of X M and X s and 
ensemble -averaged, 

2X M *-Y L = SX M *-X M -H ML + SX m *-X s -Hsl ...(27) 
SXs*-Y L - 2X S *-X M -H ML + SX s *-Xs-Hsl ...(28) 
SX M *-Y R = EX M *- Xm-Hmr + 2X„* • X s ■ H SR ...(29) 
SX s *-Yr = SXs*-X m -H„r + SXs*-X s -H SR ...(30) 
are respectively obtained. 
[0091] 

In the equations (27) to (30) , since X M and X s are 
approximately uncorrelated with each other, such a term 
having X M *'X S or X S *-X M becomes approximately zero when 
ensemble -averaged. Further, 

X M *-X„ = |X„| 2 

X s *-X s = | X s | 2 

hence, the equations (27) to (30) respectively become 
SX„*-Y L = 2|X M | 2 H ML ...(27') 
SX S *-Y L = 2|Xs| 2 H SL ...(28') 
SX„*-Y R = S|X m | 2 H„r ... (29' ) 
2X S *-Y R = S|X S | 2 H SR ... (30') . 
From the equations (27') to (30'), 
H ML = SX M *-Y L /2 |X„| 2 ...(31) 
H SL = SX S *-Y L /S | X s | 2 ...(32) 
Hmr = SXm*-Yr/2|X„| 2 . . . (33) 
H SR = SX s *-Y R /2 |X S | 2 ...(34) 



- 141 - 



are respectively derived. 
[0092] 

Impulse responses h ML , h SL , Iimr and h SR obtained by 
applying the inverse Fourier transformation to these derived 
5 composite transfer functions H ML/ H S l/ H mr and H S r are the 

filter characteristics to be set to the filter means 140-1, 
140-2, 140-3 and 140-4, respectively. Therefore, the 
transfer function calculating means 158 derives the 
respective composite transfer functions H M l/ H S l/ H mr and H S r 

10 from the equations (31) to (34) based on the sum signal x M , 

the difference signal x S/ and output signals y L and y R of the 
microphones MC(L) and MC(R) that are inputted, derives the 
impulse responses h ML , h S L/ h^ and h SR by applying the inverse 
Fourier transformation to those derived composite transfer 

15 functions, sets the derived impulse responses to the filter 
means 140-1, 140-2, 140-3 and 140-4, respectively, and 
further, updates the impulse responses by repeating this 
calculation per suitably determined prescribed time period 
(e.g. time period of performing ensemble averaging). 

20 [0093] 

(In case of Adaptive Type Operation) 

Assuming that the filter characteristics set to the 
filter means 14 0-1, 14 0-2, 14 0-3 and 14 0-4 are given as H^ ML , 
H^sl/ and H^ S r (h^ML/ h^sL/ h^MR and h^ SR when expressed in 

25 terms of the impulse responses) , the signals e L and e R 
outputted from the subtracters 14 8 and 150 become 
El = Yl ~ (Xm " H^ml+Xs * H^sl) 
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= { (X M +X s )H LL /2+ (X M -X s )H RL /2} - (X m -H" ml +X s -H"sl) 
= X M { (H LL +H RL ) /2-H"ml} + X s { (Hll-Hrl) /2-H" SL } ...(35) 
Er = Y R - (X M • H^ M r+Xs • H^ S r) 

= { (Xm+Xs)H L r/2+(X m -Xs)Hrr/2} - (X m -H" mr +X s -H"sr) 
5 = X M { (H L r+Hrr)/2-H" mr } + X s { (H lr -Hrr)/2-H" S r} ...(36). 

When the composite transfer functions are given as 
Hml = ( Hll + Hrl ) / 2 
Hsl = ( Hll ~ Hrl ) / 2 
Hmr = (Hlr + Hrr) /2 
10 Hsr = (Hlr - Hrr) /2 

the equations (35) and (36) respectively become 
El = Xm (Hml~H^ml) + Xg (Hsl~H^sl) • • • (35 ') 
Er = Xm (Hmr-H^mr) + Xs (Hsr-H sr) ... (36 1 ) . 
When the estimated errors of the composite transfer 
15 functions are given as 
AHml = Hml ~ H ml 
AHsl = Hsl ~ H sl 
A Hmr = Hmr - H^ MR 
AHsr = Hsr - H * S R 
20 the equations (35 f ) and (36') respectively become 
E L = Xm'AHml + Xs-AHsl ...(35") 
E R = Xm-AHmr + Xs'AHsr ...(36"). 

When both sides of the equations (35") and (36") are 
multiplied by complex conjugates X M * and X s * of X M and X s and 
2 5 ensembl e - ave raged , 

SX M *-E L = £X m *-X m -AHml + 2X M *-X S -AH SL ...(37) 
SX S *-E L = SX s *-X m -AHml + SX s *-X s -AHsl ...(38) 
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SX M *-E R = SX M *-X M -AH MR + 2X M *-Xs-AH S R ...(39) 
SX S *-E R = 2Xs*-Xm-AH mr + EX S *-X S -AH SR ...(40) 
are respectively obtained. 
[0094] 

5 In the equations (3 7) to (4 0) , since X M and X s are 

approximately uncorrelated with each other, such a term 
having X M *"X S or X S *-X M becomes approximately zero when 
ensemble -averaged . Further , 
X M *-X M = |X M | 2 
10 X s *-X s = |X S | 2 

hence, the equations (37) to (40) respectively become 



SX M *- 


E L = S|X„| 2 AH ML 


. . . (37' ) 


SX S *- 


E L = S | X s | 2 AH SL 


. . . (38* ) 


SX M *- 


E R = S | X M | 2 AHmr 


. . . (39' ) 


SX S *- 


E r = S | X s | 2 AH S r 


. . . (40' ) . 


From 


the equations 


(37') to (40 


AH ML 


= 2X M *-E L /S |X M | 2 


. - . (41) 


AHsl 


= SX S *-E L /S | X s | 2 


. . . (42) 


AH„r 


= SX„*-E R /S |X„| 2 


. . . (43) 


AH SR 


= SX S *-E R /S 1 X s 1 2 


. • • (44) 



are respectively derived. 
[0095] 

Using the estimated errors AH ML/ AH SL / AH MR and AH SR 
derived from the equations (40) to (44) , the filter 
25 characteristics of the filter means 140-1, 140-2, 140-3 and 
14 0-4 are updated per suitably determined prescribed time 
period (e.g. time period of performing ensemble averaging). 
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For example, assuming that impulse responses h ML , h SL , h^R and 
h S R after K-th updating are given as h M L(k), h S i,(k), h M R(k) and 
h S R(k), using impulse responses Ah M L# Ah SL , AIimr and Ah SR 
corresponding to the derived estimated errors AH ML , AH S l/ AHmr 
and AHsr, 

h ML (k+l) = h ML (k) + aAh ML ...(45) 
h SL (k+l) = h SL (k) + aAh SL ...(46) 
h-MR (k+1) = hwR(k) + aAh MR ...(47) 
h SR (k+l) = h SR (k) + aAh SR ...(48). 

Using these updating equations, (k+l)th impulse 
responses h ML (k+l), h SL (k+l), h MR (k+l) and h SR (k+l) are derived 
and set to the filter means 140-1, 140-2, 140-3 and 140-4, 
respectively, which is repeated per suitably determined 
prescribed time period (e.g. time period of performing 
ensemble averaging) . 
[0096] 

Fig. 13 shows another structural example in the 
stereo echo canceller 16, 24 of Fig. 2, wherein 
sum/difference signal producing means is arranged on 
transmission lines to loudspeakers. The same symbols are 
used with respect to those portions common to the foregoing 
structure of Fig. 12. Left/right two-channel stereo signals 
x L and x R transmitted from the spot on the counterpart side 
and inputted into line input ends LI (L) and LI (R) are 
inputted into sum/difference signal producing means 152. The 
sum/difference signal producing means 152 performs addition 
of the stereo signals x L and x R using an adder 154 so as to 
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produce a sum signal x M (= x L +x R ) , while performs subtraction 
thereof using a subtracter 156 so as to produce a difference 
signal x s {= x L -x R (or it may also be x R -x L ) } . Stereo audio 
signal demodulating means 162 performs addition of the sum 
5 and difference signals x M and x s using an adder 164, and 

further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 166 to recover the original signal x L , while 
performs subtraction of the sum and difference signals x M and 
x s using a subtracter 168, and further, gives thereto a 
10 coefficient 1/2 using a coefficient multiplier 170 to recover 
the original signal x R . The recovered signals x L and x R are 
outputted from sound output ends S0(L) and SO(R) and 
reproduced at loudspeakers SP(L) and SP(R), respectively. 
[0097] 

15 Transfer function calculating means 158 implements a 

cross -spectrum calculation between the sum signal x M and the 
difference signal x s produced by the sum/difference signal 
producing means 152 and signals e L and e R outputted from 
subtracters 148 and 150 and, based on this cross-spectrum 

20 calculation, performs setting and updating of filter 

characteristics of filter means 140-1 to 140-4. Operations 
thereof are the same as those described with respect to the 
structure of Fig. 12. Operations of the other portions are 
also the same as those described with respect to the 

25 structure of Fig. 12. 

[0098] 

Fig. 14 shows another structural example in the 
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stereo echo canceller 16, 24 of Fig. 2, wherein transmission 
is implemented between the spots A and B of Fig. 2 in the 
signal form of the sum signal x M and the difference signal x S/ 
instead of the signal form of the stereo signals x L and x R . 
5 The same symbols are used with respect to those portions 

common to the foregoing structure of Fig. 12 or 13. A sum 
signal x M (= x L +x R ) and a difference signal x s {= x L -x R (or it 
may also be x R -x L ) } transmitted from the spot on the 
counterpart side and inputted into line input ends LI (L) and 

10 LI (R) are inputted into stereo audio signal demodulating 

means 162. The stereo audio signal demodulating means 162 
performs addition of the sum and difference signals x M and x s 
using an adder 164, and further, gives thereto a coefficient 
1/2 using a coefficient multiplier 166 to recover the 

15 original signal x L , while performs subtraction of the sum and 
difference signals x M and x s using a subtracter 168, and 
further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 170 to recover the original signal x R . The 
recovered signals x L and x R are outputted from sound output 

20 ends SO(L) and SO(R) and reproduced at loudspeakers SP(L) and 
SP(R) , respectively. 
[0099] 

Transfer function calculating means 158 implements a 
cross- spectrum calculation between the sum signal x M and the 
25 difference signal x s inputted from the line input ends LI (L) 
and LI (R) and signals e L and e R outputted from subtracters 
148 and 150 and, based on this cross-spectrum calculation, 
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performs setting and updating of filter characteristics of 
filter means 140-1 to 140-4. Operations thereof are the same 
as those described with respect to the structure of Fig. 12 
or 13. Sum/difference signal producing means 172 performs 
5 addition, using an adder 173, of the signals e L and e R 

output ted from the subtracters 14 8 and 15 0 so as to produce a 
sum signal e M (= e L +e R ) , while performs subtraction thereof 
using a subtracter 175 so as to produce a difference signal 
£s {= e L-e R (or it may also be e R -e L ) } , then sends them toward 
10 the spot on the counterpart side. Operations of the other 

portions are the same as those described with respect to the 
structure of Fig. 12 or 13. 
[0100] 

Fig. 15 shows another structural example in the 
15 stereo echo canceller 16, 24 of Fig. 2. The same symbols are 
used with respect to those portions common to the foregoing 
structure of Fig. 12, 13 or 14 . A sum signal x M (= x L +x R ) and 
a difference signal x s {= x L -x R (or it may also be x R -x L ) } 
transmitted from the spot on the counterpart side and 
20 inputted into line input ends LI (L) and LI (R) are inputted 

into stereo audio signal demodulating means 162. The stereo 
audio signal demodulating means 162 performs addition of the 
sum and difference signals x M and x s using an adder 164, and 
further, gives thereto a coefficient 1/2 using a coefficient 
25 multiplier 166 to recover the original signal x L , while 

performs subtraction of the sum and difference signals x M and 
x s using a subtracter 168, and further, gives thereto a 
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coefficient 1/2 using a coefficient multiplier 170 to recover 
the original signal x R . The recovered signals x L and x R are 
outputted from sound output ends SO(L) and SO(R) and 
reproduced at loudspeakers SP(L) and SP(R), respectively. 
5 [0101] 

Sum/difference signal producing means 152 performs 
addition, using an adder 154, of the stereo signals x L and x R 
recovered by the stereo audio signal demodulating means 162 
so as to produce a sum signal x M (= x L +x R ) , while performs 

10 subtraction thereof using a subtracter 156 so as to produce a 
difference signal x s {= x L -x R (or it may also be x R -x L ) } . 
Transfer function calculating means 158 implements a cross- 
spectrum calculation between the sum signal x M and the 
difference signal x s produced by the sum/difference signal 

15 producing means 152 and signals e L and e R outputted from 

subtracters 148 and 150 and, based on this cross -spectrum 
calculation, performs setting and updating of filter 
characteristics of filter means 140-1 to 140-4. Operations 
thereof are the same as those described with respect to the 

20 structure of Fig. 12, 13 or 14. Sum/difference signal 

producing means 172 performs addition, using an adder 173, of 
the signals e L and e R outputted from the subtracters 14 8 and 
150 so as to produce a sum signal e M (= e L +e R ) , while performs 
subtraction thereof using a subtracter 175 so as to produce a 

25 difference signal e s {= e L -e R (or it may also be e R -e L ) } , then 
sends them toward the spot on the counterpart side . 
Operations of the other portions are the same as those 
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described with respect to the structure of Fig. 12, 13 or 14. 
[0102] 

Fig. 16 shows another structural example in the 
stereo echo canceller 16, 24. A sum signal x M (= x L +x R ) and a 
5 difference signal x s {= x L -x R (or it may also be x R -x L ) } 
transmitted from the spot on the counterpart side and 
inputted into line input ends LI (L) and LI (R) are inputted 
into stereo audio signal demodulating means 262. The stereo 
audio signal demodulating means 262 performs addition of the 

10 sum and difference signals x M and x s using an adder 264, and 
further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 266 to recover the original signal x L , while 
performs subtraction of the sum and difference signals x M and 
x s using a subtracter 268, and further, gives thereto a 

15 coefficient 1/2 using a coefficient multiplier 270 to recover 
the original signal x R . The recovered signals x L and x R are 
outputted from sound output ends SO(L) and SO(R) and 
reproduced at loudspeakers SP(L) and SP(R), respectively. 
[0103] 

20 Collected audio signals y L and y R of microphones 

MC(L) and MC(R) are inputted into sum/difference signal 
producing means 272. The sum/difference signal producing 
means 272 implements addition of the microphone collected 
audio signals y L and y R using an adder 2 73 so as to produce a 

25 sum signal y M , while implements subtraction thereof using a 
subtracter 275 so as to produce a difference signal y s . 
[0104] 
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Filer means 240-1 to 240-4 are formed by, for 
example, FIR filters. These filter means 240-1 to 240-4 are 
each set with an impulse response corresponding to a 
composite transfer function in the form of combination of 
transfer functions of suitable two systems among transfer 
functions H L l/ H LR/ H rl and Hrr of four audio transfer systems 
between the loudspeakers SP(L) and SP(R) and microphones 
MC(L) and MC(R), respectively, and perform a convolution 
calculation of the left/right two-channel stereo signals x L 
and x R using such impulse responses, thereby producing echo 
cancel signals EC1 to EC4 , respectively. 

[0105] 

An adder 244 performs a calculation of EC1+EC3 . An 
adder 246 performs a calculation of EC2+EC4 . A subtracter 
248 subtracts an echo cancel signal EC1+EC3 from the sum 
signal y M , thereby to perform echo cancellation. A 
subtracter 250 subtracts an echo cancel signal EC2+EC4 from 
the difference signal y s , thereby to perform echo 
cancellation. Signals e M and e s outputted from the 
subtracters 248 and 250 are outputted from line output ends 
LO(L) and LO(R), respectively, and transmitted toward the 
spot on the counterpart side. 
[0106] 

Transfer function calculating means 258 implements a 
cross -spectrum calculation between the sum signal x M and the 
difference signal x s inputted from the line input ends LI (L) 
and LI (R) and the signals e M and e s outputted from the 
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subtracters 248 and 250 and, based on this cross-spectrum 
calculation, performs setting and updating of filter 
characteristics (impulse responses) of the filter means 240-1 
to 240-4. Specifically, upon starting the system, the filter 
5 characteristics of the filter means 240-1 to 240-4 are not 

set, i.e. coefficients are all set to zero, so that the echo 
cancel signals EC1 to EC4 are zero, and thus the sum signal 
y M and the difference signal y s outputted from the 
sum/difference signal producing means 272, as they are, are 

10 outputted from the subtracters 248 and 250. Therefore, at 
this time, the transfer function calculating means 258 
performs the cross -spectrum calculation between the sum 
signal x M and the difference signal x s inputted from the line 
input ends LI (L) and LI (R) and the sum signal y M and the 

15 difference signal y s outputted from the subtracters 248 and 
250 and, based on this cross-spectrum calculation, derives a 
plurality of composite transfer functions each in the form of 
combination of transfer functions of suitable two systems 
among transfer functions H LL , H LR , H RL and Hrr of four audio 

20 transfer systems between the loudspeakers SP(L) and SP(R) and 
the microphones MC(L) and MC (R) , respectively, and implements 
initial setting of the filter characteristics of the filter 
means 240-1 to 240-4 to values corresponding to such 
composite transfer functions. After the initial setting, 

25 since the echo cancel signals are produced by the filter 

means 240-1 to 240-4, the echo cancel error signals e M and e s 
corresponding to difference signals between the sum signal y M 
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and the difference signal y s outputted from the 

sum/difference signal producing means 2 72 and the echo cancel 
signals EC1 to EC4 are outputted from the subtracters 248 and 
250. Therefore, at this time, the transfer function 
5 calculating means 258 performs the cross -spectrum calculation 
between the sum signal x M and the difference signal x s 
inputted from the line input ends LI (L) and LI (R) and the 
echo cancel error signals e M and e s outputted from the 
subtracters 248 and 250 and, based on this cross-spectrum 

10 calculation, derives estimated errors of the foregoing 

composite transfer functions, respectively, and updates the 
filter characteristics of the filter means 240-1 to 240-4 to 
values that cancel such estimated errors, respectively. By 
repeating this updating operation per prescribed time period, 

15 the echo cancel error can be converged to a minimum value . 
Further, even if the transfer functions change due to 
movement of the microphone positions or the like, the echo 
cancel error can be converged to a minimum value by 
sequentially updating the filter characteristics of the 

20 filter means 240-1 to 240-4 depending thereon. 
[0107] 

Correlation detecting means 260 detects a 
correlation between the sum signal x M and the difference 
signal x s based on a correlation value calculation or the 
25 like, and stops updating of the foregoing filter 

characteristics when the correlation value is no less than a 
prescribed value. When the correlation value becomes lower 
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than the prescribed value, updating of the foregoing filter 
characteristics is restarted. 
[0108] 

Herein, the filter characteristics (impulse 
responses) that are set to the filter means 240-1 to 240-4 by 
the transfer function calculating means 258 will be described. 
In the transfer function calculating means 258, the following 
calculation is performed. 
(In case of Fixed Type Operation) 

The sum and difference signals y M and y s of the 
output signals y L and y R of the microphones MC(L) and MC(R) , 
assuming that frequency- axis expressions of y M and y s are 
respectively given as Y M and Y S/ become 

Y M = Y L + Y R 

= (X L -H LL + Xr-Hrl) + (Xl-Hlr + Xr-Hrr) 
= X L (H LL + H LR ) + X R (H RL + Hrr) ... (49) 

Ys = Y L - Y R 

= (X L -H LL + Xr'Hrl) - (X L "H LR + Xr-Hrr) 
= Xl ( Hll ~ Hlr ) + Xr ( Hrl - Hrr ) ...(50). 

When the composite transfer functions are given as 
Hlm — Hll + Hlr 

Hrm = Hrl + Hrr 

H L s = H L l ~ H LR 

H R s - H RIj - Hrr 

then 

X L = (X M +X s )/2 

X R = (X M -X s )/2 
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hence, the equations (4 9) and (50) respectively become 
Y M = (Xm+Xs)Hlm/2 + (X m -X s )Hrm/2 

= Xm (Hlm+Hrm) /2 + Xs ( Hlm - Hrm ) / 2 
Y s = (X M +X s )H LS /2 + (X„-X s )H RS /2 
5 = X M (H LS +H RS )/2 + X S (H LS -H RS ) /2 

thus 

2Ym = Xm (Hlm+Hrm) + Xs (Hlm -Hrm) • • • (49 1 ) 
2Y S = X„(H LS +H RS ) + X S (H LS -H RS ) ...(50') 

When both sides of the equations (49') and (50') are 
10 multiplied by complex conjugates X M * and X s * of X M and X s and 
ensemble -averaged, 

SX„*-Y„ = SX M *-X M (Hlm+Hrm) + SX„* • X s (Hlm-Hrm) ...(51) 
SX S *-Y M = 2X S *-X M (Hlm+Hrm) + SX S * • X s (Hlm-Hrm) ...(52) 
SX„*-Y S = SX m *-X m (Hls+Hr S ) + SXm*-X s (Hls-H rs ) ...(53) 
15 SX S *-Y S = SX s *-X m (Hls+Hrs) + SX S * • X s (Hl S -H rs ) ...(54) 

are respectively obtained. 
[0109] 

In the equations (51) to (54) , since X M and X s are 
approximately uncorrelated with each other, such a term 
20 having X M * - X S or X S *-X M becomes approximately zero when 
ensemble-averaged. Further, 

X M *-X„ = |X M | 2 

X s *-X s = | X s | 2 

hence, the equations (51) to (54) respectively become 
25 SX M *-Y M = S | X M | 2 (Hlm+Hrm) ...(51') 

2X S *-Y M = S | X s | 2 (Hlm-Hrm) ...(52') 
SX M *-Y S = S |X„| 2 (Hls+Hr S ) ...(53') 
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SX s *-Ys = 2|X S | 2 (H LS -H RS ) ...(54')- 

By transforming the equations (51 1 ) to (54 ! ), the 
following composite transfer functions are respectively 
derived . 



Hlm+Hrm = SXm* ' Ym/ 2 1 Xm 1 2 . - 


. (51") 




Hlm-Hrm = SXs* ' Y M /S | X s | 2 .. 


. (52") 




H LS +H RS = SX M *-Y S /S |X M | 2 .. 


. (53") 




H LS -H RS = SXs*-Y s /S |X S | 2 .. 


. (54") 




From the equations (51") 


to (54"), 




H LM = SX M *-Y M /S I X M 1 2 + SX S * 


•Y M /s|x s | 2 . 


. . (55) 


Hrm = SX M *-Y M /S |X M | 2 - 2X S * 


•Y„/S|X S | 2 . 


. . (56) 


H LS = SX M *-Y s /2 |X M | 2 + SX S * 


•Y S /S|X S | 2 . 


. . (57) 


H RS = SX M *-Y S /S|X M | 2 - SX S * 


•Y S /S|X S | 2 . 


. . (58) 



are respectively derived. 

15 [0110] 

Impulse responses h^, h™, h LS and h RS obtained by 
applying the inverse Fourier transformation to these derived 
composite transfer functions Hlm, Hrm, H ls and H RS are the 
filter characteristics to be set to the filter means 240-1, 

20 240-2, 240-3 and 240-4, respectively. Therefore, the 
transfer function calculating means 258 derives the 
respective composite transfer functions Hlm, Hrm, H L s and H RS 
from the equations (55) to (58) based on the sum signal x M 
and the difference signal x s inputted into the line input 

25 ends LI (L) and LI (R) and the sum signal y M and the difference 
signal y s outputted from the sum/difference signal producing 
means 2 72, derives the impulse responses Iilm / h^, h LS and h RS 
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by applying the inverse Fourier transformation to those 
derived composite transfer functions, sets the derived 
impulse responses to the filter means 240-1, 240-2, 240-3 and 
240-4, respectively, and further, updates the impulse 
5 responses by repeating this calculation per suitably 

determined prescribed time period (e.g. time period of 
performing ensemble averaging) . 
[0111] 

(In case of Adaptive Type Operation) 
10 The signals e M and e s outputted from the subtracters 

24 8 and 250, assuming that frequency- axis expressions of e M 
and e s are respectively given as E M and E s and the filter 
characteristics set to the filter means 240-1, 240-2, 240-3 
and 240-4 are given as H^lm/ H^rm, H^ ls and H^ RS (h^ LM , h^RM/ 
15 h^ L s and h^ RS when expressed in terms of the impulse 
responses) , become 

E M = {L(H LL +H LR ) + Xr^WHrr)} - (Xl'H^lm + X r -H%m) 

= L{ (H LL +H LR ) - H"lm} + X R { (Hrl+Hrr) - H"rm} . . . (59) 
Es = { L ( Hll ~ Hl R ) + Xr(Hrl-Hrr) } - (Xl'H^ls + X R *H^ R s) 
20 = L{ (H LL -H LR ) - H" LS } + X R { (Hrl-Hrr) - H^ RS } . . . (60) . 

When the composite transfer functions are given as 
Hlm = H L l + H LR 

Hrm = Hrl + Hrr 

Hls = Hll ~ H LR 
2 5 H RS = Hrl - Hrr 

then 

X L = (X M +X s )/2 



X R = (X M -X s )/2 

hence, the equations (59) and (60) respectively become 

E M = (Xm+X s ) • (Hlm-H\m)/2 + (X M -X S ) • (Hrm-H^rm) /2 . . . (61) 
E s = (Xm+Xs) • (H LS -H~ LS ) /2 + (X M -X S ) ■ (H RS -H A RS ) /2 . . . (62) . 
When the estimated errors of the composite transfer 

functions are given as 



AHlm 


= Hlm " 


- H^LM 


AHrm 


= Hrm - 


" H rm 


AH LS 


= H L s ■ 


■ H^ls 


AH RS 


- Hrs - 


- H^R S 



the equations (61) and (62) respectively become 
Em = X M (AHlm+AHrm) /2 + X s (AHlm-AHrm) /2 
E s = X M (AH LS +AH RS ) /2 + X s (AH L s-AH R s) /2 

hence 

2 Em = X m (AH L m+AH rm ) + X s (AHlm-AHrm) ...(61') 

2E S = Xm(AH ls +AH rs ) + Xs(AH LS -AH RS ) ... (62') . 

When both sides of the equations (61') and (62') are 
multiplied by complex conjugates X M * and X s * of X M and X s and 
ensemble - averaged , 

2X M *-2E M = SX M *-X M (AHlm+AHrm) + SX M * ■ X s (AHlm- 
AHrm) ...(63) 

SX S *'2E M = SX S *-X M (AHlm+AHrm) + SX S * • X s (AHlm- 
AHrm) ... (64) 

SX M *'2E S = SX m *-X m (AHls+AHrs) + SX„* • X s ( AH LS - 
AHrs) - • - (65) 

2X S *-2E S = SX s *-Xm(AH L s+AHrs) + 2X S * • X s ( AH LS - 
AHrs) ... (66) 
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are respectively obtained. 
[0112] 

In the equations (63) to (66) , since X M and X s are 
approximately uncorrelated with each other, such a term 
5 having X M *-X S or X S *-X M becomes approximately zero when 
ensemble - averaged . Further, 

X„*-X M = |X„| 2 

X s *-X s = | X s | 2 

hence, the equations (63) to (66) respectively become 
10 2X„*-2E M = S | X„ | 2 (AHlm+AHrm) ...(63') 

SX S *-2E M = S | X s | 2 (AHlm-AHrm) ...(64') 
2X M *-2E S = S |X M | 2 (AH L s+AH RS ) ...(65') 
SX S *-2E S = S|X S | 2 (AH LS -AH RS ) ...(66'). 
By transforming the equations (63 * ) to (66 1 ), the 
15 following composite transfer functions are respectively 
derived . 

AHlm+AHrm = SX M *'2E M /S | X M | 2 ...(63") 
AHlm-AHrm = SX S * • 2E M /S | X s | 2 ...(64") 
AH L s+AH RS = SX M *-2E S /S | X„ | 2 ...(65") 

20 AH ls -AHr S = SX S *-2E S /S |X S | 2 ...(66") 

From the equations (63") to (66"), 
AH LM = SX M *-2E„/S |X M | 2 + SX S *-2E M /S |X S | 2 ...(67) 
AHrm = SX M *-2E M /S |X M | 2 - SX S *-2E„/S|X S | 2 ...(68) 
AH LS = SX M *'2E s /2 |X M | 2 + SX S *-2E S /S|X S | 2 ...(69) 

25 AH RS = SX„*-2E S /S | X M | 2 - 2X S * • 2E S /S | X s | 2 ...(70) 

are respectively derived. 
[0113] 



\ 
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Using the estimated errors AHlm, AHrm, AH ls and AH RS 
derived from the equations (67) to (70) , the filter 
characteristics of the filter means 240-1, 240-2, 240-3 and 
240-4 are updated per suitably determined prescribed time 
5 period (e.g. time period of performing ensemble averaging). 

For example, assuming that impulse responses h^, h^, h LS and 
h RS after K-th updating are given as h L M(k), h^k), h LS (k) and 
h RS (k), using impulse responses Ah^, Ah™, Ah LS and Ah RS 
corresponding to the derived estimated errors AHlm, AH^, AH ls 
10 and AH RS , 

hLM(k'+l) = hLM(k) + aAh LM ...(71) 
hRM(k+l) = hRM(k) + aAhRM ...(72) 
h LS (k+l) = h LS (k) + aAh LS ...(73) 
h RS (k+l) = h RS (k) + aAh RS ...(74). 
15 Using these updating equations, (k+l)th impulse 

responses hu^k+l), h RM (k+l), h LS (k+l) and h RS (k+l) are derived 
and set to the filter means 240-1, 240-2, 240-3 and 240-4, 
respectively, which is repeated per suitably determined 
prescribed time period (e.g. time period of performing 
2 0 ensemble averaging) . 

[0114] 

Fig. 17 shows another structural example in the 
stereo echo canceller 16, 24 of Fig. 2. The same symbols are 
used with respect to those portions common to the foregoing 
25 structure of Fig. 16. Left/right two-channel stereo signals 
x L and x R transmitted from the spot on the counterpart side 
and inputted into line input ends LI (L) and LI (R) are 
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outputted from sound output ends SO(L) and SO(R) as they are 
(i.e. not through sum/difference signal producing means 252), 
and reproduced at loudspeakers SP(L) and SP (R) , respectively. 
The sum/difference signal producing means 252 performs 
5 addition of such stereo signals x L and x R using an adder 254 
so as to produce a sum signal x M (= x L +x R ) , while performs 
subtraction thereof using a subtracter 256 so as to produce a 
difference signal x s {= x L -x R (or it may also be x R -x L ) } . 
[0115] 

10 Transfer function calculating means 258 implements a 

cross -spectrum calculation between the sum signal x M and the 
difference signal x s produced by the sum/difference signal 
producing means 2 52 and signals e M and e s outputted from 
subtracters 248 and 250 and, based on this cross-spectrum 

15 calculation, performs setting and updating of filter 
characteristics of the. filter means 240-1 to 240-4. 
Operations thereof are the same as those described with 
respect to the structure of Fig. 16. The signals e M and e s 
outputted from the subtracters 24 8 and 250 are inputted into 

20 stereo audio signal demodulating means 282. The stereo audio 
signal demodulating means 282 performs addition of the 
signals e M and e s using an adder 284, and further, gives 
thereto a coefficient 1/2 using a coefficient multiplier 286 
to recover a left-channel signal e L , while performs 

25 subtraction of the signals e M and e s using a subtracter 288, 
and further, gives thereto a coefficient 1/2 using a 
coefficient multiplier 290 to recover a right-channel signal 
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e R . The recovered signals e L and e R are respectively 
outputted from line output ends LO(L) and LO (R) and 
transmitted toward the spot on the counterpart side. 
Operations of the other portions are the same as those 
5 -described with respect to the structure of Fig. 16. 
[0116] 

Fig. 18 shows another structural example in the 
stereo echo canceller 16, 24 of Fig. 2. The same symbols are 
used with respect to those portions common to the foregoing 

10 structure of Fig. 16 or 17 . A sum signal x M (= x L +x R ) and a 
difference signal x s {= x L -x R (or it may also be x R -x L ) } 
transmitted from the spot on the counterpart side and 
inputted into line input ends LI (L) and LI (R) are inputted 
into stereo audio signal demodulating means 262. The stereo 

15 audio signal demodulating means 2 62 performs addition of the 
sum and difference signals x M and x s using an adder 264, and 
further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 266 to recover the original signal x L , while 
performs subtraction of the sum and difference signals x M and 

20 x s using a subtracter 268, and further, gives thereto a 

coefficient 1/2 using a coefficient multiplier 270 to recover 
the original signal x R . The recovered signals x L and x R are 
outputted from sound output ends SO(L) and SO(R) and 
reproduced at loudspeakers SP(L) and SP(R), respectively. 

25 Sum/difference signal producing means 252 performs addition 
of such stereo signals x L and x R using an adder 254 so as to 
produce a sum signal x M (= x L +x R ) , while performs subtraction 
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thereof using a subtracter 256 so as to produce a difference 
signal x s {= x L -x R (or it may also be x R -x L ) } . 
[0117] 

Transfer function calculating means 258 implements a 
5 cross -spectrum calculation between the sum signal x M and the 
difference signal x s produced by the sum/difference signal 
producing means 2 52 and signals e M and e s output ted from 
subtracters 248 and 250 and, based on this cross-spectrum 
calculation, performs setting and updating of filter 

10 characteristics of filter means 240-1 to 240-4. Operations 
thereof are the same as those described with respect to the 
structure of Fig. 16 or 17. The signals e M and e s outputted 
from the subtracters 248 and 250 are respectively outputted 
from line output ends LO(L) and LO(R) and transmitted toward 

15 the spot on the counterpart side. Operations of the other 

portions are the same as those described with respect to the 
structure of Fig. 16 or 17. 
[0118] 

Fig. 19 shows another structural example in the 
20 stereo echo canceller 16, 24 of Fig. 2. The same symbols are 
used with respect to those portions common to the foregoing 
structure of Fig. 16, 17 or 18. Left/right two-channel 
stereo signals x L and x R transmitted from the spot on the 
counterpart side and inputted into line input ends LI (L) and 
25 LI (R) are inputted into sum/difference signal producing means 
252. The sum/difference signal producing means 252 performs 
addition of the stereo signals x L and x R using an adder 254 
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so as to produce a sum signal x M (= x L +x R ) , while performs 
subtraction thereof using a subtracter 256 so as to produce a 
difference signal x s {= x L -x R (or it may also be x R -x L ) } . The 
produced sum signal x M and difference signal x s are inputted 
5 into stereo audio signal demodulating means 262. The stereo 
audio signal demodulating means 2 62 performs addition of the 
sum and difference signals x M and x s using an adder 264, and 
further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 266 to recover the original signal x L , while 

10 performs subtraction of the sum and difference signals x M and 
x s using a subtracter 268, and further, gives thereto a 
coefficient 1/2 using a coefficient multiplier 270 to recover 
the original signal x R . The recovered signals x L and x R are 
outputted from sound output ends SO(L) and SO(R) and 

15 reproduced at loudspeakers SP(L) and SP (R) , respectively. 
[0119] 

Transfer function calculating means 258 implements a 
cross -spectrum calculation between the sum signal x M and the 
difference signal x s produced by the sum/difference signal 

20 producing means 252 and signals e M and e s outputted from 

subtracters 248 and 250 and, based on this cross-spectrum 
calculation, performs setting and updating of filter 
characteristics of the filter means 240-1 to 240-4. 
Operations thereof are the same as those described with 

25 respect to the structure of Fig. 16, 17 or 18. The signals 
e M and e s outputted from the subtracters 248 and 250 are 
inputted into stereo audio signal demodulating means 282. 
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The stereo audio signal demodulating means 282 performs 
addition of the signals e M and e s using an adder 284, and 
further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 286 to recover a left-channel signal e L/ while 
5 performs subtraction of the signals e M and e s using a 

subtracter 288, and further, gives thereto a coefficient 1/2 
using a coefficient multiplier 290 to recover a right-channel 
signal e R . The recovered signals e L and e R are respectively 
outputted from line output ends LO (L) and LO(R) and 
10 transmitted toward the spot on the counterpart side. 

Operations of the other portions are the same as those 
described with respect to the structure of Fig. 16, 17 or 18. 
[0120] 

Fig. 2 0 shows another structural example in the 
15 stereo echo canceller 16, 24. A sum signal x M (= x L +x R ) and a 
difference signal x s { = x L -x R (or it may also be x R -x L ) } 
transmitted from the spot on the counterpart side and 
inputted into line input ends LI (L) and LI (R) are inputted 
into stereo audio signal demodulating means 362. The stereo 
2 0 audio signal demodulating means 3 62 performs addition of the 
sum and difference signals x M and x s using an adder 364, and 
further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 366 to recover the original signal x L , while 
performs subtraction of the sum and difference signals x M and 
25 x s using a subtracter 368, and further, gives thereto a 

coefficient 1/2 using a coefficient multiplier 370 to recover 
the original signal x R . The recovered signals x L and x R are 
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outputted from sound output ends SO(L) and SO(R) and 
reproduced at loudspeakers SP (L) and SP(R), respectively. 
[0121] 

Collected audio signals y L and y R of microphones 
5 MC(L) and MC(R) are inputted into sum/difference signal 

producing means 372. The sum/difference signal producing 
means 372 implements addition of the microphone collected 
audio signals y L and y R using an adder 3 73 so as to produce a 
sum signal y M , while implements subtraction thereof using a 
10 subtracter 375 so as to produce a difference signal y s . 
[0122] 

Filer means 340-1 to 340-4 are formed by, for 
example, FIR filters. These filter means 340-1 to 340-4 are 
each set with an impulse response corresponding to a 

15 composite transfer function in the form of combination of 

transfer functions H LL/ H LR , Hrl and Hrr of four audio transfer 
systems between the loudspeakers SP(L) and SP(R) and 
microphones MC(L) and MC(R), respectively, and perform, using 
such impulse responses, a convolution calculation of the sum 

20 signal x M and the difference signal x s inputted from the line 
input ends LI (L) and LI (R) , thereby producing echo cancel 
signals EC1 to EC4 , respectively. 
[0123] 

An adder 344 performs a calculation of EC1+EC3 . An 
25 adder 346 performs a calculation of EC2+EC4 . A subtracter 
348 subtracts an echo, cancel signal EC1+EC3 from the sum 
signal y M , thereby to perform echo cancellation. A 
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subtracter 350 subtracts an echo cancel signal EC2+EC4 from 
the difference signal y S/ thereby to perform echo 
cancellation. Signals e M and e s outputted from the 
subtracters 348 and 350 are outputted from line output ends 
5 LO(L) and LO(R), respectively, and transmitted toward the 
spot on the counterpart side. 
[0124] 

Transfer function calculating means 358 implements a 
cross-spectrum calculation between the sum signal x M and the 

10 difference signal x s inputted from the line input ends LI (L) 
and LI (R) and the signals e M and e s outputted from the 
subtracters 348 and 350 and, based on this cross -spectrum 
calculation, performs setting and updating of filter 
characteristics (impulse responses) of the filter means 340-1 

15 to 340-4. Specifically, upon starting the system, the filter 
characteristics of the filter means 340-1 to 340-4 are not 
set, i.e. coefficients are all set to zero, so that the echo 
cancel signals EC1 to EC4 are zero, and thus the sum signal 
y M and the difference signal y s outputted from the 

20 sum/difference signal producing means 372, as they are, are 
outputted from the subtracters 348 and 350. Therefore, at 
this time, the transfer function calculating means 358 
performs the cross -spectrum calculation between the sum 
signal x M and the difference signal x s inputted from the line 

25 input ends LI (L) and LI (R) and the sum signal e M and the 

difference signal e s outputted from the subtracters 348 and 
350 and, based on this cross-spectrum calculation, derives a 
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plurality of composite transfer functions each in the form of 
combination of transfer functions H LL , H LR/ H RL and Hrr of four 
audio transfer systems between the loudspeakers SP(L) and 
SP(R) and the microphones MC(L) and MC(R), respectively, and 
5 implements initial setting of the filter characteristics of 
the filter means 340-1 to 340-4 to values corresponding to 
such composite transfer functions. After the initial setting, 
since the echo cancel signals are produced by the filter 
means 34 0-1 to 34 0-4, the echo cancel error signals e M and e s 
10 corresponding to difference signals between the sum signal y M 
and the difference signal y s outputted from the 

sum/difference signal producing means 3 72 and the echo cancel 
signals EC1 to EC4 are outputted from the subtracters 348 and 
350. Therefore, at this time, the transfer function 

15 calculating means 358 performs the cross-spectrum calculation 
between the sum signal x M and the difference signal x s 
inputted from the line input ends LI (L) and LI (R) and the 
echo cancel error signals e M and e s outputted from the 
subtracters 348 and 350 and, based on this cross-spectrum 

20 calculation, derives estimated errors of the foregoing 

composite transfer functions, respectively, and updates the 
filter characteristics of the filter means 340-1 to 340-4 to 
values that cancel such estimated errors, respectively. By 
repeating this updating operation per prescribed time period, 

25 the echo cancel error can be converged to a minimum value. 
Further, even if the transfer functions change due to 
movement of the microphone positions or the like, the echo 
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cancel error can be converged to a minimum value by 
sequentially updating the filter characteristics of the 
filter means 340-1 to 340-4 depending thereon. 
[0125] 

5 Correlation detecting means 360 detects a 

correlation between the sum signal x M and the difference 
signal x s based on a correlation value calculation or the 
like, and stops updating of the foregoing filter 
characteristics when the correlation value is no less than a 
10 prescribed value. When the correlation value becomes lower 
than the prescribed value, updating of the foregoing filter 
characteristics is restarted. 
[0126] 

Herein, the filter characteristics (impulse 
15 responses) that are set to the filter means 340-1 to 340-4 by 
the transfer function calculating means 358 will be described. 
In the transfer function calculating means 358, the following 
calculation is performed. 
(In case of Fixed Type Operation) 
20 The sum and difference signals y M and y s of the 

output signals y L and y R of the microphones MC(L) and MC(R) 
become 

Y M = Y L + Y R 

= (X L -H LL + Xr-Hrl) + (X L -H LR + Xr-Hrr) 
25 = X L (H LL + H LR ) + X r (Hrl + Hrr) ... (71) 

Y s = Y L - Y R 

= (X L -H LL + X R • H R l ) - (X L -H LR + Xr-Hrr) 
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= X L (H LL - H LR ) + X R (H RL - Hrr) ... (72) . 
X L = (X M +X s )/2 
X R = (X„-X s )/2 

hence, the equations (71) and (72) respectively become 
5 Y M - (X„+X s ) • (H LL +H LR ) /2 + (X M -X S ) • (Hrl+Hrr) /2 

= Xm (Hll+Hlr+Hrl+Hrr) /2 + Xs (Hll+Hlr-Hrl-Hrr) /2 ... (71 1 ) 
Y s = (X„+X s ) • (H LL -H LR )/2 + (X M -X S ) • (Hr^-Hrr) /2 

= Xm (Hll-Hlr+Hrl-Hrr) /2 + Xs (Hll-Hlr-Hrl+Hrr) /2 . . . (72 1 ) 
When the composite transfer functions are given as 
10 Hmm = (Hll+Hlr+Hrl+Hrr) /2 

HsM - (Hll + Hlr-Hrl-Hrr) /2 
HmS - ( H LL " Hlr+Hrl ~ Hrr ) / 2 

Hss — (Hll-Hlr-Hrl+Hrr) /2 
the equations (71 f ) and (72*) respectively become 
15 Y M = X m -H„m + Xs-Hsm ...(71") 

Y s = X M -H M s + X s /H ss '. . . (72") . 

When both sides of the equations (71") and (72") are 

multiplied by complex conjugates X M * and X s * of X M and X s and 

ensemble -averaged, 
20 SX M *-Y M = SX„*-X M -H MM + SX m *-X s -Hsm ...(73) 

£X S *-Y M = SX S *-X M -H MM + 2X S *-X S -H SM ...(74) 
£X M *-Y S = SX M *-X M -H MS + SX M *-X S -H SS ...(75) 
SX S *-Y S = SX s *-X m -Hms + 2X S *-X S -H SS ...(76) 

are respectively obtained. 
25 [0127] 

In the equations (73) to (76) , since X M and X s are 

approximately uncorrelated with each other, such a term 
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having X M *'X S or X S *-X M becomes approximately zero when 
ensemble-averaged . Further, 

X„*-X„ = |X„| 2 

X s *-X s = |X S | 2 

5 hence, the equations (73) to (76) respectively become 

EX M *-Y M = 2|X m | 2 -Hmm ...(73') 

SX S *-Y M = S | Xs | 2 " Hsm . • - (74 ' ) 

SX M *-Ys = 2 |X„| 2 -H MS ...(75M 

£Xs*-Y s = S|X S | 2 -H SS ...(76'). 
10 From the equations (73 ! ) to (76'), 

Hmm = 2X M *-Y M /S |X m | 2 ...(77) 

H SM = SX S *-Y M /S |X S | 2 ...(78) 

H MS = SX M *-Y S /S |X m | 2 . . . (79) 

H ss = SX S *- Y s /S | X s | 2 ...(80) 
15 are respectively derived. 

[0128] 

Impulse responses Iimm, h S M, h M s and h S s obtained by 
applying the inverse Fourier transformation to these derived 
composite transfer functions Hmm, H S m/ H ms and H S s are the 

20 filter characteristics to be set to the filter means 340-1, 
340-2, 340-3 and 340-4, respectively. Therefore, the 
transfer function calculating means 3 58 derives the 
respective composite transfer functions Hmm, H sm , H ms and H ss 
from the equations (77) to (80) based on the sum signal x M 

25 and the difference signal x s inputted into the line input 

ends LI (L) and LI (R) and the sum signal y M and the difference 
signal y s outputted from the sum/difference signal producing 
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means 372, derives the impulse responses h*™, h SM / h MS and h S s 
by applying the inverse Fourier transformation to those 
derived composite transfer functions, sets the derived 
impulse responses to the filter means 340-1, 340-2, 340-3 and 
5 340-4, respectively, and further, updates the impulse 
responses by repeating this calculation per suitably 
determined prescribed time period (e.g. time period of 
performing ensemble averaging) . 
[0129] 

10 (In case of Adaptive Type Operation) 

Assuming that the filter characteristics set to the 
filter means 340-1, 340-2, 340-3 and 340-4 are given as H^mm, 
H^sm, H% s and H^ S s (h^MM/ h^sM, h^ MS and h^ ss when expressed in 
terms of the impulse responses) , the signals e M and e s 
15 outputted from the subtracters 348 and 350 become 

E M = { (X M +X S ) ; (H LL +H LR )/2 + (X M -X S ) ■ (Hrl+Hrr) /2 } 
(Xm'H^mm + Xs " H sm) 
- Xm [ { (Hll+Hlr+Hrl+Hrr) /2 } - H^mm] 

+ X s [{ (Hll+Hlr-Hrl-Hrr) /2} - H" SM ] ...(81) 
20 E s = { (X M +Xs) • (H LL -H LR ) /2 + (X„-X s ) • (H RL -H RR )/2} 

(Xm'H^ms + Xs'H^ss) 
= X M [ { (Hll-Hlr+Hrl-Hrr) /2 } - H^ MS ] 

+ Xs [ { (Hll-Hlr-Hrl+Hrr) /2 } - H^ss] • • • ( 82 ) 
When the composite transfer functions are given as 
25 Hmm = (Hll+Hlr+Hrl+Hrr) /2 

Hsm = (Hll+Hlr-Hrl-Hrr) /2 
Hms = ( H L l ~ H LR +H RL - Hrr ) / 2 
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H ss = (Hll-Hlr-Hrl+Hrr) /2 
the equations (81) and (82) respectively become 
Em = Xm ( Hmm ~ H mm ) + Xs ( Hsm ~ H sm) . . . ( 8 1 1 ) 
Es — Xm (Hms~H ms) + Xs (Hss~H ss) ... (82 1 ) - 
When 

AHmm = Hmm ~ H^mm 
AH S m — H S m - H^sm 
AHms - Hms " H^ms 
AHss = Hss ~ H ss 

are given, the equations (81 f ) and (82') respectively become 
E M = Xm'AHmm + Xs'AHsm ...(81") 
E s = Xm'AHms + Xs-AHss . . - (82») . 

When both sides of the equations (81 n ) and (82") are 
multiplied by complex conjugates X M * and X s * of X M and X s and 
ensemble - averaged , 

EX M *-E M = EX m *-Xm-AHmm + EX m *-X s -AHsm ...(83) 
2X S *-E M = SX s *-X m -AHmm + SX s *-X s -AH S m ...(84) 
SX M *'E S = 2Xm*-X m -AH ms + 2X M *-X s -AHss ...(85) 
SX s *-Es = SX s *-Xm-AH M s + SX s *-X s -AHss ...(86) 
are respectively obtained. 
[0130] 

In the equations (83) to (86) , since X M and X s are 
approximately uncorrelated with each other, such a term 
having X M *-X S or X S *-X M becomes approximately zero when 
ensemble - averaged . Further, 

x M *-x M = |X M | 2 

Xs*-X s = | X s | 2 
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hence, the equations (83) to (86) respectively become 

£X M *-E M = S|X m | 2 -AHmm ...(83') 

2X S *-E M = S|X S | 2 -AH SM ...(84') 

2X M *-E S = S|X M | 2 -AH MS ...(85') 
5 SX S *-E S = S|X S | 2 -AH SS ...(86'). 

From the equations (83') to (86 f ), 

AHmm = SX M *-E M /S |X M | 2 ...(87) 

AH SM = SX s *-E M /£ |X S | 2 ...(88) 

AH MS = £X M *-E s /2 |X m | 2 ...(89) 
10 AH SS = SX S *-E S /S |X S | 2 ...(90) 

are respectively derived. 

[0131] 

Using the estimated errors AHmm/ AH SM/ AH ms and AH SS 
derived from the equations (87) to (90) , the filter 

15 characteristics of the filter means 340-1, 340-2, 340-3 and 
340-4 are updated per suitably determined prescribed time 
period (e.g. time period of performing ensemble averaging). 
For example, assuming that impulse responses h^, h S M# h MS and 
h ss after K-th updating are given as h^k), h SM (k), h MS (k) and 

20 hss(k), using impulse responses AhMM/ Ah S M/ Ah MS and Ah ss 

corresponding to the derived estimated errors AHmm/ AH S m/ AH ms 
and AH S s/ 

hMM(k+l) = hMw(k) + aAhwM ...(91) 
h SM (k+l) = h SM (k) + aAhsM ...(92) 
25 h MS (k+l) = h MS (k) + aAh MS ...(93) 

h ss (k+l) = h ss (k) + aAh ss ...(94). 

Using these updating equations, (k+l)th impulse 
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responses hrayidc+l), h SM (k+l), h MS (k+l) and h ss (k+D are derived 
and set to the filter means 340-1, 340-2, 340-3 and 340-4, 
respectively, which is repeated per suitably determined 
prescribed time period (e.g. time period of performing 
5 ensemble averaging) . 

[0132] 

Fig. 21 shows another structural example in the 
stereo echo canceller 16, 24 of Fig. 2. The same symbols are 
used with respect to those portions common to the foregoing 

10 structure of Fig. 20. Left/right two-channel stereo signals 
x L and x R transmitted from the spot on the counterpart side 
and inputted into line input ends LI (L) and LI (R) are 
outputted from sound output ends S0(L) and SO(R) as they are 
(i.e. not through sum/difference signal producing means 352), 

15 and reproduced at loudspeakers SP(L) and SP(R), respectively. 
The sum/difference signal producing means 352 performs 
addition of such stereo signals x L and x R using an adder 354 
so as to produce a sum signal x M (= x L +x R ) , while performs 
subtraction thereof using a subtracter 356 so as to produce a 

20 difference signal x s {= x L -x R (or it may also be x R -x L ) } . 
[0133] 

Transfer function calculating means 358 implements a 
cross- spectrum calculation between the sum signal x M and the 
difference signal x s produced by the sum/difference signal 
2 5 producing means 3 52 and signals e M and e s outputted from 

subtracters 348 and 350 and, based on this cross-spectrum 
calculation, performs setting and updating of filter 
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characteristics of the filter means 340-1 to 340-4. 
Operations thereof are the same as those described with 
respect to the structure of Fig. 20. The signals e M and e s 
outputted from the subtracters 348 and 350 are inputted into 
5 stereo audio signal demodulating means 382. The stereo audio 
signal demodulating means 382 performs addition of the 
signals e M and e s using an adder 384, and further, gives 
thereto a coefficient 1/2 using a coefficient multiplier 386 
to recover a left-channel signal e L , while performs 

10 subtraction of the signals e M and e s using a subtracter 388, 
and further, gives thereto a coefficient 1/2 using a 
coefficient multiplier 390 to recover a right-channel signal 
e R . The recovered signals e L and e R are respectively 
outputted from line output ends LO(L) and L0(R) and 

15 transmitted toward the spot on the counterpart side. 

Operations of the other portions are the same as those 
described with respect to the structure of Fig. 20. 
[0134] 

Fig. 22 shows another structural example in the 
20 stereo echo canceller 16, 24 of Fig. 2. The same symbols are 
used with respect to those portions common to the foregoing 
structure of Fig. 20 or 21. A sum signal x M (= x L +x R ) and a 
difference signal x s {= x L -x R (or it may also be x R -x L ) } 
transmitted from the spot on the counterpart side and 
25 inputted into line input ends LI (L) and LI (R) are inputted 

into stereo audio signal demodulating means 362. The stereo 
audio signal demodulating means 3 62 performs addition of the 
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sum and difference signals x M and x s using an adder 364, and 
further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 366 to recover the original signal x L/ while 
performs subtraction of the sum and difference signals x M and 
5 x s using a subtracter 368, and further, gives thereto a 

coefficient 1/2 using a coefficient multiplier 370 to recover 
the original signal x R . The recovered signals x L and x R are 
outputted from sound output ends S0(L) and SO(R) and 
reproduced at loudspeakers SP(L) and SP(R), respectively. 

10 Sum/difference signal producing means 3 52 performs addition 
of such stereo signals x L and x R using an adder 354 so as to 
produce a sum signal x M (= x L +x R ) , while performs subtraction 
thereof using a subtracter 356 so as to produce a difference 
signal x s {= x L -x R (or it may also be x R -x L ) } . 

15 [0135] 

Transfer function calculating means 358 implements a 
cross -spectrum calculation between the sum signal x M and the 
difference signal x s produced by the sum/difference signal 
producing means 3 52 and signals e M and e s outputted from 

20 subtracters 348 and 350 and, based on this cross-spectrum 
calculation, performs setting and updating of filter 
characteristics of filter means 340-1 to 340-4. Operations 
thereof are the same as those described with respect to the 
structure of Fig. 20 or 21. The signals e M and e s outputted 

25 from the subtracters 348 and 350 are respectively outputted 
from line output ends LO(L) and LO(R) and transmitted toward 
the spot on the counterpart side. Operations of the other 
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portions are the same as those described with respect to the 
structure of Fig. 20 or 21. 
[0136] 

Fig. 23 shows another structural example in the 
5 stereo echo canceller 16, 24 of Fig. 2. The same symbols are 
used with respect to those portions common to the foregoing 
structure of Fig. 20, 21 or 22. Left/right two-channel 
stereo signals x L and x R transmitted from the spot on the 
counterpart side and inputted into line input ends LI (L) and 

10 LI (R) are inputted into sum/difference signal producing means 
352. The sum/difference signal producing means 352 performs 
addition of the stereo signals x L and x R using an adder 354 
so as to produce a sum signal x M (= x L +x R ) , while performs 
subtraction thereof using a subtracter 356 so as to produce a 

15 difference signal x s {= x L -x R (or it may also be x R -x L ) } . The 
produced sum signal x M and difference signal x s are inputted 
into stereo audio signal demodulating means 362. The stereo 
audio signal demodulating means 362 performs addition of the 
sum and difference signals x M and x s using an adder 3 64, and 

20 further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 366 to recover the original signal x L , while 
performs subtraction of the sum and difference signals x M and 
x s using a subtracter 368, and further, gives thereto a 
coefficient 1/2 using a coefficient multiplier 370 to recover 

2 5 the original signal x R . The recovered signals x L and x R are 
outputted from sound output ends SO(L) and S0(R) and 
reproduced at loudspeakers SP(L) and SP(R), respectively. 
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[0137] 

Transfer function calculating means 358 implements a 
cross-spectrum calculation between the sum signal x M and the 
difference signal x s produced by the sum/difference signal 
5 producing means 3 52 and signals e M and e s outputted from 

subtracters 348 and 350 and, based on this cross-spectrum 
calculation, performs setting and updating of filter 
characteristics of the filter means 340-1 to 340-4. 
Operations thereof are the same as those described with 

10 respect to the structure of Fig. 20 , 21 or 22. The signals 
e M and e s outputted from the subtracters 348 and 350 are 
inputted into stereo audio signal demodulating means 382. 
The stereo audio signal demodulating means 3 82 performs 
addition of the signals e M and e s using an adder 384, and 

15 further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 386 to recover a left-channel signal e L/ while 
performs subtraction of the signals e M and e s using a 
subtracter 388, and further, gives thereto a coefficient 1/2 
using a coefficient multiplier 390 to recover a right-channel 

20 signal e R . The recovered signals e L and e R are respectively 
outputted from line output ends LO(L) and LO (R) and 
transmitted toward the spot on the counterpart side. 
Operations of the other portions are the same as those 
described with respect to the structure of Fig. 20, 21 or 22. 

25 [0138] 

This invention can take various structures in 
addition to the structures as shown in the foregoing 
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embodiments. For example, part of the structures of Figs. 1, 
9 to 11 can be changed like Fig. 24. Specifically, 
sum/difference signal producing means 4 02 is disposed between 
the adders 44 and 4 6 and the subtracters 4 8 and 50, so as to 
5 perform addition of the echo cancel signals EC1+EC3 and 

EC2+EC4 using an adder 4 04 to produce an echo cancel signal 
(EC1+EC3 ) + (EC2+EC4) , while perform subtraction thereof using 
a subtracter 406 to produce (EC1+EC3 ) - (EC2+EC4 ) . Further, 
sum/difference signal producing means 408 is disposed between 

10 the sound input ends SI (L) and SI (R) and the subtracters 48 
and 50, so as to perform addition of the collected audio 
signals y L and y R of the microphones MC(L) and MC(R) using an 
adder 410 to produce a sum signal y M , while perform 
subtraction thereof using a subtracter 412 to produce a 

15 difference signal y s . The subtracter 48 subtracts the echo 
cancel signal (EC1+EC3 ) + (EC2+EC4 ) from the sum signal y M to 
implement echo cancellation. The subtracter 50 subtracts the 
echo cancel signal (EC1+EC3 ) - (EC2+EC4 ) from the difference 
signal y s to implement echo cancellation. The signals e M and 

2 0 e s outputted from the subtracters 48 and 50 are inputted into 
stereo audio signal demodulating means 414. The stereo audio 
signal demodulating means 414 performs addition of the 
signals e M and e s using an adder 416, and further, gives 
thereto a coefficient 1/2 using a coefficient multiplier 418 

25 to recover a left-channel signal e L , while performs 

subtraction of the signals e M and e s using a subtracter 420, 
and further, gives thereto a coefficient 1/2 using a 
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coefficient multiplier 422 to recover a right-channel signal 
e R . The other portions are the same as those described with 
respect to the structures of Figs. 1, 9 to 11. 
[0139] 

5 On the other hand, part of the structures of Figs. 

12 to 15 can be changed like Fig. 25. Specifically, 
sum/difference signal producing means 432 is disposed between 
the adders 144 and 146 and the subtracters 14 8 and 150, so as 
to perform addition of the echo cancel signals EC1+EC3 and 

10 EC2+EC4 using an adder 434 to produce an echo cancel signal 
(EC1+EC3 ) + (EC2+EC4 ) , while perform subtraction thereof using 
a subtracter 436 to produce (EC1+EC3 ) - (EC2+EC4 ) . Further, 
sum/difference signal producing means 438 is disposed between 
the sound input ends SI (L) and SI (R) and the subtracters 148 

15 and 150, so as to perform addition of the collected audio 

signals y L and y R of the microphones MC(L) and MC(R) using an 
adder 44 0 to produce a sum signal y M , while perform 
subtraction thereof using a subtracter 442 to produce a 
difference signal y s - The subtracter 148 subtracts the echo 

20 cancel signal (EC1+EC3 ) + (EC2+EC4 ) from the sum signal y M to 
implement echo cancellation. The subtracter 150 subtracts 
the echo cancel signal (EC1+EC3 ) - (EC2+EC4 ) from the 
difference signal y s to implement echo cancellation. The 
signals e M and e s outputted from the subtracters 14 8 and 150 

25 are inputted into stereo audio signal demodulating means 444. 
The stereo audio signal demodulating means 444 performs 
addition of the signals e M and e s using an adder 446, and 
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• further, gives thereto a coefficient 1/2 using a coefficient 
multiplier 448 to recover a left-channel signal e L/ while 
performs subtraction of the signals e M and e s using a 
subtracter 450, and further, gives thereto a coefficient 1/2 
5 using a coefficient multiplier 452 to recover a right-channel 
signal e R . The other portions are the same as those 
described with respect to the structures of Figs. 12 to 15. 
[0140] 

On the other hand, part of the structures of Figs. 

10 16 to 19 can be changed like Fig. 26. Specifically, 

sum/difference signal producing means 462 is disposed between 
the adders 244 and 246 and the subtracters 248 and 250, so as 
to perform addition of the echo cancel signals EC1+EC3 and 
EC2+EC4 using an adder 464 to produce an echo cancel signal 

15 (EC1+EC3) + (EC2+EC4) , while perform subtraction thereof using 

a subtracter 466 to produce (EC1+EC3 ) - (EC2+EC4 ) . The 
sum/difference signal producing means 272 of Figs. 16 to 19 
is not required. The subtracter 24 8 subtracts the echo 
cancel signal (EC1+EC3 ) + (EC2+EC4) from the collected audio 

20 signal y L of the microphone MC(L) to implement echo 

cancellation. The subtracter 250 subtracts the echo cancel 
signal (EC1+EC3) - (EC2+EC4) from the collected audio signal y R 
of the microphone MC(R) to implement echo cancellation. 
Sum/difference signal producing means 468 is disposed on the 

25 output side of the subtracters 248 and 250, so as to perform 
addition of the output signals e L and e R of the subtracters 
248 and 250 using an adder 470 to produce a sum signal e M , 
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while perform subtraction thereof using a subtracter 4 72 to 
produce a difference signal e s - The other portions are the 
same as those described with respect to the structures of 
Figs. 16 to 19. 
5 [0141] 

On the other hand, part of the structures of Figs. 
20 and 21 can be changed like Fig. 27. Specifically, 
sum/difference signal producing means 482 is disposed between 
the adders 344 and 346 and the subtracters 348 and 350, so as 

10 to perform addition of the echo cancel signals EC1+EC3 and 
EC2+EC4 using an adder 484 .to produce an echo cancel signal 
(EC1+EC3) + (EC2+EC4) , while perform subtraction thereof using 
a subtracter 486 to produce (EC1+EC3) - (EC2+EC4) . The 
sum/difference signal producing means 372 of Figs. 20 to 23 

15 is not required. The subtracter 348 subtracts the echo 

cancel signal (EC1+EC3) + (EC2+EC4) from the collected audio 
signal y L of the microphone MC(L) to implement echo 
cancellation. The subtracter 350 subtracts the echo cancel 
signal (EC1+EC3) - (EC2+EC4) from the collected audio signal y R 

20 of the microphone MC(R) to implement echo cancellation. 

Sum/difference signal producing means 4 88 is disposed on the 
output side of the subtracters 348 and 350, so as to perform 
addition of the output signals e L and e R of the subtracters 
348 and 350 using an adder 490 to produce a sum signal e M , 

25 while perform subtraction thereof using a subtracter 4 92 to 
produce a difference signal e s . The other portions are the 
same as those described with respect to the structures of 
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Figs. 20 to 23. 

[0142] 

Fig. 2 8 shows another structural example in the 
stereo echo canceller 16, 24 of Fig. 2. The same symbols are 
5 used with respect to those portions common to the foregoing 
structure of Fig. 1. Left/right two-channel stereo signals 
x L and x R transmitted from the spot on the counterpart side 
and inputted into line input ends LI (L) and LI (R) are 
outputted from sound output ends SO(L) and SO(R) as they are 
10 (i.e. not through an orthogonalizing filter 500), and 

reproduced at loudspeakers SP (L) and SP (R) , respectively. 
[0143] 

Filter means 40-1 is set with an impulse response 
corresponding to a transfer function between the loudspeaker 

15 SP(L) and a microphone MC(L) / filter means 40-2 is set with 
an impulse response corresponding to a transfer function 
between the loudspeaker SP(L) and a microphone MC (R) , filter 
means 4 0-3 is set with an impulse response corresponding to a 
transfer function between the loudspeaker SP(R) and the 

20 microphone MC(L), and filter means 40-4 is set with an 

impulse response corresponding to a transfer function between 
the loudspeaker SP(R) and the microphone MC(R) . 
[0144] 

The orthogonalizing filter 500 performs a principal 
25 component analysis with respect to the input stereo signals 
x L and x R per prescribed time period, and converts such input 
stereo signals x L and x R into two signals that are orthogonal 
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to each other. Transfer function calculating means 502 
implements a cross-spectrum calculation between the mutually 
orthogonal two signals produced at the orthogonal i zing filter 
500 and signals e L and e R outputted from subtracters 48 and 
5 50 and, based on this cross-spectrum calculation, sets filter 
characteristics (impulse responses) of the filter means 40-1 
to 40-4. Specifically, upon starting the system, the filter 
characteristics of the filter means 40-1 to 40-4 are not set, 
i.e. coefficients are all set to zero, so that echo cancel 

10 signals EC1 to EC4 are zero, and thus collected audio signals 
of the microphones MC(L) and MC(R) themselves are outputted 
from the subtracters 48 and 50. Therefore, at this time, the 
transfer function calculating means 502 performs the cross- 
spectrum calculation between the mutually orthogonal two 

15 signals produced at the orthogonalizing filter 500 and the 
collected audio signals e L and e R of the microphones MC(L) 
and MC(R) outputted from the subtracters 48 and 50 and, based 
on this cross -spectrum calculation, derives transfer 
functions of four audio transfer systems between the 

20 loudspeakers SP(L) and SP (R) and the microphones MC(L) and 
MC(R), respectively, and implements initial setting of the 
filter characteristics of the filter means 40-1 to 40-4 to 
values corresponding to such transfer functions. After the 
initial setting, since the echo cancel signals are produced 

25 by the filter means 40-1 to 40-4, the echo cancel error 

signals e L and e R corresponding to difference signals between 
the collected audio signals of the microphones MC(L) and 
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MC(R) and the echo cancel signals EC1 to EC4 are outputted 
from the subtracters 4 8 and 50. Therefore, at this time, the 
transfer function calculating means 502 performs the cross- 
spectrum calculation between the mutually orthogonal two 
5 signals produced at the orthogonal i zing filter 500 and the 
echo cancel error signals e L and e R outputted from the 
subtracters 48 and 50 and, based on this cross-spectrum 
calculation, derives estimated errors of the transfer 
functions of the four audio transfer systems between the 

10 loudspeakers SP(L) and SP(R) and the microphones MC(L) and 
MC(R), respectively, and updates the filter characteristics 
of the filter means 40-1 to 40-4 to values that cancel the 
estimated errors, respectively. By repeating this updating 
operation per prescribed time period, the echo cancel error 

15 can be converged to a minimum value. Further, even if the 
transfer functions change due to movement of the microphone 
positions or the like, the echo cancel error can be converged 
to a minimum value by sequentially updating the filter 
characteristics of the filter means 40-1 to 40-4 depending 

2 0 thereon. 

[0145] 

According to the known technique based on comparison 
between the left/right two-channel stereo signals x L and x R 
inputted into the line input ends LI (L) and LI (R) and the 
25 signals e L and e R to be outputted from line output ends LO(L) 
and L0(R), double talk detecting means 504 detects the double 
talk where sounds other than those reproduced by the 
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loudspeakers SP(L) and SP(R) are inputted into the 
microphones MC(L) and MC(R). The transfer function 
calculating means 502 makes relatively longer an update 
period of the filter characteristics of the filter means 40-1 
5 to 40-4 while the double talk is detected, whereas makes 
relatively shorter the update period of the filter 
characteristics while the double talk is not detected. This 
makes it possible to fully converge estimated errors when the 
double talk exists, and further, quicken convergence of 
10 estimated errors when there is no double talk. 
[0146] 

An orthogonalizat ion process of the orthogonal i zing 
filter 500 will be described. The orthogonalization process 
is implemented per prescribed time period of the input stereo 

15 signals. Herein, the orthogonalization process is carried 

out per frame (e.g. 512 samples) as shown in Fig. 29 (however, 
overlap processing may be implemented as shown in Fig. 4 9 
which will be described later) . A vector having as its 
elements a sample group of one frame of the left-channel 

20 input signal x L inputted into the orthogonal i zing filter 500, 
and a vector having as its elements a sample group of one 
frame of the right -channel input signal x R are given as 
[Equation 1] 

25 X L = (X LX , X L2 > X L3 5* * *j X Ln ) 

X R = ( X R\ 9 X R2 9 X R3 9' ' *5 X Rn ) 
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Since x L and x R are the stereo signals, they mutually 
have a correlation. The orthogonalization process is carried 
out by performing a principal component analysis, using x L 
and x R as two variables, with respect to a sample group 
5 composed of combinations of such two variables per frame so 
as to derive eigenvectors of the first principal component 
and the second principal component that are mutually 
orthogonal, and projecting the respective samples composed of 
combinations of such two variables onto the derived 
10 eigenvectors of the first principal component and the second 
principal component, respectively. 
[0147] 

Concrete contents of calculation of the 
orthogonalization process will be described. Now, assuming 
15 that an observation matrix B is given as 
[Equation 2] 

g _ <X Ll > X L2> X L3 »" * '» X Ln ^ 
^ X R\ •> X R2 9 X R3 »' ' X Rn J 

2 0 then, a covariance matrix S of B becomes 
[Equation 3] 
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S = BB T (B T is a transposed matrix of B) 
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5 Hence, 
from 

[Equation 4] 



10 



S u —A S n 
S 2l S 22 -Xj 



= 0 



15 



[Equation 5] 

(Sn-A) (S 22 -A) 
is solved, 
thus 

[Equation 6] 



- Si 2 S 2 i = 0 



X = 



S u +S 22 ±^l(S n +S 22 ) 2 -4(S u S 22 -S n 2 ) 
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so that two solutions for eigenvalues A are derived. 
[0148] 

Assuming that one of the two eigenvalues having 
greater variance (eigenvalue of the first principal 
component) is X lf an eigenvector U max corresponding to the 
eigenvalue Xi is 
[Equation 8] 



which establishes 
[Equation 7] 



$11 ^12 



where u u 2 +u X2 =1 



By solving u xl and u 12 , 
[Equation 9] 



u„ =±- 



12 ~ / 2 2 

(double signs in same order) 



are derived. Regardless of the sign of Uu and u i2 being pi 
or minus, the axis represented by the first principal 



component is the same. 
[0149] 

On the other hand, assuming that one of the two 
eigenvalues having smaller variance (eigenvalue of the second 
principal component) is A 2/ an eigenvector U min corresponding 
to the eigenvalue A 2 is 
[Equation 11] 




which establishes 



[Equation 10] 




where u 2l +w 22 =1 



By solving u 2 i and u 22/ 



[Equation 12] 




St 



12 



= ± 



^2 



11 




(double signs in same order) 



are derived. Regardless of the sign of u 2i and u 22 being plus 
or minus, the axis represented by the second principal 
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component is the same. 

From the foregoing, the eigenvector U max of the first 
principal component and the eigenvector U m i n of the second 
principal component are derived as follows. 
5 [Equation 13] 





(«iA — • 


(u \ 

M 21 








V u n) 





[0150] 

10 With respect to the eigenvectors U ma x and U min 

obtained from the covariance matrix, it is not possible to 
predict in which quadrant it appears due to its nature. If 
the quadrant in which the eigenvector U max appears changes per 
frame, U max 's mutually cancel themselves upon ensemble- 

15 averaging Umax's per block later. If the quadrant in which 

the eigenvector U min appears changes per frame, U m i n ! s mutually 
cancel themselves upon ensemble-averaging U m i n ' s per block 
later. Therefore, a conversion operation is performed for 
fixing the quadrants in which the eigenvectors U ma x and U min 

2 0 appear. For example, when fixing the eigenvector U max to the 

first quadrant and the eigenvector U min to the fourth quadrant, 
it can be realized by the following conversion operation. 
Of the eigenvectors U max and U min , 

• one existing in the first quadrant (positive, 

25 positive) or the third quadrant (negative, negative) is given 
as U m ax 1 / and 

• one existing in the second quadrant (negative, 
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positive) or the fourth quadrant (positive, negative) is 
given as U m ± n 1 . Then, conversion is executed such that 

• when Umax 1 exists in the first quadrant, U ma x=U max l 

• when Umax 1 exists in the third quadrant, U ma x=-Umax f 
5 • when U min f exists in the second quadrant, U min =-U m in ! 

■ when Umin 1 exists in the fourth quadrant, U m in=U min l . 
Through such a conversion operation, the quadrants 
in which the eigenvectors U max and U min appear can be fixed. 
[0151] 

10 Onto the eigenvector U max of the first principal 

component and the eigenvector U m i n of the second principal 
component that are obtained as described above, a column 
vector of the observation matrix B given as 
[Equation 14] 



15 



b = 



\ X Rn) 



is projected. A value of an output signal x max obtained by 
projecting the observation matrix B onto the eigenvector U, 
20 is derived as 
[Equation 15] 

^max = ^ # ^max (• represents inner product) 



25 



Further, a value of an output signal x m i n obtained by- 
projecting the observation matrix B onto the eigenvector U m i n 
is derived as 
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[Equation 16] 

=fe«f/ min (• represents inner product) 



5 [0152] 

The transfer function calculation process of the 
transfer function calculating means 502 will be described. 
(In case of Fixed Type Operation) 

A filter characteristic of the orthogonalizing 
10 filter 500 is given as U. The filter characteristic U is a 
characteristic that projects the input signals x L and x R onto 
the mutually uncorrelated signals x max and x min based on the 
principal component analysis, and the following relationship 
is established . 
15 [Equation 17] 



X 



= U 



u= (u n O 

l«21 "22 ) 



(95) 



An inverse characteristic of the filter 
20 characteristic U is given as V. The inverse filter 

characteristic V is a characteristic that restores the 
signals x ma x and x min to the original signals x L and x R/ and 
the following relationship is established. 
[Equation 18] 



25 
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UV = VU=I (I is a unit matrix) 



V = 



V V 21 V 22J 



(96) 



[0153] 

Output signals y L and y R of the microphones MC(L) and 
MC(R) are expressed as 

Y L = H LL -X L + H RIj -X R . . . (97) 

Y R = H LR -X L + Hrr-Xr . . . (98) . 

From the equations (95) and (96) , 
[Equation 19] 



V ll V 12 

v v 2i v 22 y 



(99) 



hence, assuming that frequency-axis expressions of x max and 
x m in are respectively given as X max and X min/ the equations (97) 
and (98) respectively become 

Y L = Hll. ( Vii * X max +Vi2 ' X m i n ) + H RIi (V2i-X m ax+V 2 2-X m in) ..-(97 f ) 
Yr = HLRtVu'Xmax+Viz-X^n) + Hrr ( V 2 i ' X ma x+V 2 2 ' X min ) ...(98 1 ). 

[0154] 

When both sides of the equation (97 1 ) are multiplied 
by complex conjugates X* max and X* min of X max and X min (i.e. 
deriving cross spectra) and ensemble -averaged, because X max 
and X min are mutually orthogonal, expected values of X* max - X min 
and X* min - X max become zero, respectively, so that the 
following two equations are obtained (note: E[ ] represents 
the ensemble average) . 
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10 



15 



E[X* m ax'Y L ] =E [X* ma x-H LL -v 11 -X max 

+ X* max -H RIl -v 21 -X max ] ...(100) 
E [X* rain • Y L ] = E [X* min • H LL • v 12 ' X min 

+ X* min -H R i J -V22-X m i n ] ...(101) 
Similarly, when both sides of the equation (98') are 
multiplied by complex conjugates X* max and X* m i n of X max and X min 
and ensemble -averaged, the following two equations are 
obtained. 

E [X* max • Y R ] = E [X* max • Hlr • v n • X max 

+ X* max -HRR-v 21 -X max ] ...(102) 
E [X* min • Y R ] = E [X* min • Hlr • v 12 ■ X rain 

+ X* min -H RR -v 22 -X min ] . . . (103) 

[0155] 

Here, if a change of the eigenvalues A is small in a 
time period of performing ensemble averaging, 



[Equation 20] 




U = 








^21 


U 2lJ 


V = 


f V " 


V ' 2 l 




^ V 21 


V 22> 



~E[U]=\ 
~E[V]= 



'£[«„] E[u l2 }^ 

^E[u 2l ] E[u 22 ] 

'E[v n ] £[v 12 ]> 

,£[v 21 ] E[v 22 ]j 



•(104) 



20 is established, so that the equations (100) to (103) are 
rewritten as the following equations (100 ') to (103') . 
E[X* max -Y L ] *E[|X max | 2 ] -H LL -E[v n ] 

+ E [ | X max | 2 ] " Hrl • E [ v 21 ] ...(100') 
E[X* min -Y L ] *E[|X min | 2 ] " Hll ■ E [Vi 2 ] 
25 + E [ |X min | 2 ] -Hrl-E [v 22 ] ...(101') 
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E[X* max -Y R ] «E[|X ma x| 2 ] -Hlr-ECvh] 

+ E[ |X ma x| 2 ] -HRR-EtvaJ 
E[X* min -Y R ] «E[|X m in| 2 ] -Hlr-ECv^] 



+ EtlXr, 



! ] 'Hrr-E [v 22 ] 



(102 ' ) 



(103 ' ) 



10 



15 



[0156] 

When both sides of the equations (100 ') and (102') 
are divided by E[|X max | 2 ] and both sides of the equations 
(101') and (103') are divided by E[|X,„i n | 2 ], respectively, 
E[X* max -Y L ]/E[|X max | 2 ] - H LL -E[ Vll ] + Hrl-E[v 21 ] 
E[X* min -Y L ]/E[|X min | 2 ] * H LL -E[v 12 ] + Hrl " E [v 22 ] 
E[X max 1 RJ /E[| ] * H LR -E[vn] + Hrr-E[v 2 i] 

E[X* min -Y R ]/E[|X min | 2 ] * H LR -E[v 12 ] + HRR-Etv^] 

hence 

[Equation 21] 



^ E[X max ' Y L ] E[X min * Y L ] ^ 

E[X max ' Y R ] E\X min * Y R ] 
^[l^max I'] ^min F] . 



H 



TJ \ 

LL RL 



E[v 21 ] E[v 22 ]; 



(105) 



20 



is obtained. From E [U] ■ E [V] *I , the equation (105) is 
rewritten as 
[Equation 22] 



( H LL 

\Hlr HrrJ 



E[X max * Y L ] E[X min * Y L ] 
E[X*max 'Y R ] E[X mm *Y R ] 



K E[u 2X \ E[u 22 \ 



(106) 
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[0157] 

Impulse responses h LL , h RL/ h LR and Iirr obtained by- 
applying the inverse Fourier transformation to the transfer 
functions H LL , Hrl, H lr and Hrr derived from the equation (106) 
are the filter characteristics to be set to the filter means 
40-1, 40-2, 40-3 and 40-4, respectively. Therefore, the 
transfer function calculating means 502 derives the 
respective transfer functions H LL , Hrl, H lr and based on 
the signals x max and x m i n outputted from the orthogonal i zing 
filter 500, the filter characteristic U of the 
orthogonalizing filter 500 and the output signals y L and y R 
of the microphones MC(L) and MC(R), derives the impulse 
responses h LL , h^, h LR and h^ by applying the inverse Fourier 
transformation to those derived transfer functions, sets the 
derived impulse responses to the filter means 40-1, 40-2, 40- 
3 and 40-4, respectively, and further, updates the impulse 
responses by repeating this calculation per suitably 
determined prescribed time period (e.g. time period of 
performing ensemble averaging) . 

[0158] 

(In case of Adaptive Type Operation) 

Assuming that the filter characteristics set to the 
filter means 40-1, 40-2, 40-3 and 40-4 are given as H^ LL , H^kl 
H^ LR and H"rr ( h^LL/ h^ RL , h^ LR and h^^ when expressed in terms 
of the impulse responses) , the signals e L and e R outputted 
from the subtracters 48 and 50 of Fig. 28 are expressed as 

E L = (Hll-Xl+Hrl-Xr) - (H^ll-Xl+H'rl-Xr) ...(107) 
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E R = (H lr -X l +Hrr-X r ) - (H'lr-Xl+H'rr-Xr) . . . (108) . 
From the foregoing equation (99) , the equations 
(107) and (108) respectively become 

El = (Hll"H^ll) ( v ll * X ma x+Vi2 ' X m i n ) 
5 + (Hrl-H"rl) (v 21 -X max +v 22 -X mi n) . . . (107') 

Er = (Hlr-H^ lr ) (Vn'Xmax+Viz-Xmin) 

+ (Hrr-H"rr) (v 21 -X max +v 22 -X min ) . . . (108 ') . 
[0159] 

When the estimated errors of the transfer functions 
10 are given as 



AH LL 


= H L l ■ 


■ H LL 


AHrl 


= Hrl - 


' H^rl 


AH LR 


= Hlr - 


- H LR 


AHrr 


= Hrr - 


" H^RR 



15 the equations (107 1 ) and (108 1 ) respectively become 

E L = AH LL (Vii-X max +Vi 2 -X m in) 

+ AH R L(v 21 -X ma x+v 22 -X min ) ... (107") 
E R = AH L R(v 11 -X max +v 12 -X min ) 

+ AH RR (v 21 -X max +v 22 -X min ) ... (108") . 
20 [0160] 

When both sides of the equation (107") are 
multiplied by complex conjugates X* max and X* min of X ma x and X m i n 
(i.e. deriving cross spectra) and ensemble-averaged, because 
X max and X min are mutually orthogonal, expected values of X* max * 
25 X min and X* m i n * X max become zero, respectively, so that the 

following two equations are obtained (note: E[ ] represents 
the ensemble average) . 
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E [X* ma x'E L ] =E [X* max ' AHll-Vh-X^x 

+ X* max -AH RL -v 21 -X max ] . . . (109) 
E [X* min • El] = E [X* min • AH LL • v 12 • X min 

+ X* mi n-AH RL -v 22 -X min ] . . . (110) 
5 Similarly, when both sides of the equation (108") 

are multiplied by complex conjugates X* max and X* min of X ra ax and 
Xmin and ensemble -averaged, the following two equations are 
obtained . 

E[X* max -E R ] =E[X* max -AH LR -V 11 -X max 
10 + X* max -AH RR -v 21 -X max ] ...(111) 

E [X* min • E R ] = E [X* min • AH LR • v 12 • X min 

+ X* min -AH RR -v 22 -X min ] ...(112)- 

[0161] 

Here, if a change of the eigenvalues A is small in a 
15 time period of performing ensemble averaging, the foregoing 
equation (104) is established so that the equations (109) to 
(112) are rewritten as the following equations (109') to 
(112 • ) . 

E[X* max -E L ] ~E[|X max | 2 ] • AH LL -E [ Vll ] 
20 + E [ |X max | 2 ] * AHrl-E [v 21 ] ...(109') 

E[X* min -E L ] *E[|X rain | 2 ] -AH LL -E[v 12 ] 

+ E[|X min | 2 ] -AHrl-ECv^] ...(HO 1 ) 
E[X* max -E R ] *E[|X max | 2 ] -AHlr-EEvh] 

r] • ahrr-e [v 21 ] ...(in 1 ) 

25 E[X* min -E R ] *E[|X min | 2 ] -AH LR -E[v 12 ] 

+ E[ |X min | 2 ] -AHrr-ECv^] ...(112') 

[0162] 
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10 



When both sides of the equations (109 ! ) and (111 1 ) 
are divided by E[|X ma x| 2 ] and both sides of the equations 
(110 1 ) and (112') are divided by E[|X m i n | 2 ]/ respectively, 
E[X* max -E L ]/E[|X max | 2 ] * AH LL -E[v n ] + AH RL -E[v 21 ] 
E[X* min -E L ]/E[|X min | 2 ] * AH LL -E[v 12 ] + AHrl • E [V22] 
E[X* max -E R ]/E[|X rnax | 2 ] - AH LR -E[v n ] + AHrr • E [V21] 
E[X* min -E R ]/E[|X min | 2 ] * AH LR -E[v 12 ] + AHrr-E^] 

hence 

[Equation 23] 



E[JC max * E L ] E\X min * E L ] 

E[\X mm | 2 ] ~EUX~JY 

E[X max 'E R ] E[JC min * E R ] 



AH^ AH 



RL 



E[v n ] E[v n ] 
{E[v 2l ] E[v 22 ]) 



•(113) 



15 



is obtained. From E [U] • E [V] *I , the equation (113) is 
rewritten as 
[Equation 24] 



E[X max ' E L ] E[X min " E L ] ^ 

"eO^jT E[\X min \ 2 ] 

E\X max " E R ] E\X min * E R ] 



[E[u n ] E[u l2 f 
E[u 21 ] E[u 22 \ 



(114) 



[0163] 

20 Using the estimated errors AH LIj/ AHrl, AH lr and AHrr 

derived from the equation (114) , the filter characteristics 
of the filter means 40-1, 40-2, 40-3 and 40-4 are updated per 
suitably determined prescribed time period (e.g. time period 
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of performing ensemble averaging) . For example, assuming 
that impulse responses h LL/ h RL/ h L R and h^ after K-th 
updating are given as h LL (k), h RL (k), h LR (k) and h^Ck), using 
impulse responses Ah LL/ Ah RL/ Ah LR and Ah^ corresponding to 
5 the derived estimated errors AH LL/ AH RL , AH LR and AH^, 

h L L(k+l) = h LL (k) + aAh LL 

hRL(k+l) = hRL(k) + aAh RL 

h L R(k+l) = h LR (k) + aAh LR 

hRR(k+l) = hRR(k) + aAhRR. 
10 Using these updating equations, (k+l)th impulse 

responses h LL (k+l), h RL (k+l), h LR (k+l) and h^Ck+l) are derived, 
and set to the filter means 40-1, 40-2, 40-3 and 40-4, 
respectively, which is repeated per suitably determined 
prescribed time period (e.g. time period of performing 
15 ensemble averaging) . 

[0164] 

Fig. 3 0 shows functional blocks of the 
orthogonal i zing filter 500 of Fig. 28. The input stereo 
signals x L and x R are inputted from input ends 506 and 508, 

20 respectively. Covariance matrix calculating means 510 

derives a covariance matrix S of the input stereo signals x L 
and x R per frame. Based on the derived covariance matrix S, 
eigenvector calculating means 512 derives coefficients Un, 
Ui2/ u 2 i and u 22 of eigenvectors U max and U min of the first and 

25 second principal components per frame. The derived 

coefficients u llf u 2 i, u i2 and u 22 are set to coefficient 
multipliers 514, 516, 518 and 520, respectively. The 
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coefficient multipliers 514 and 516 give the coefficients Un 
and u 2 i to the input signal x L to derive x L -Un and x L -u 2 i, 
respectively. The coefficient multipliers 518 and 520 give 
the coefficients u i2 and u 22 to the input signal x R to derive 
5 x R -Ui2 and x R -u 22 , respectively. An adder 522 derives 

*l* u X i+Xr- u 12 as a signal x max obtained by projecting the input 
stereo signals x L and x R onto the eigenvector U max . An adder 
524 derives x L • u 21 +x R - u 22 as a signal x min obtained by 
projecting the input stereo signals x L and x R onto the 
10 eigenvector U min . The signals x max and x min are respectively 
outputted from output ends 526 and 528, while the 
coefficients Un, u 2 i, u i2 and u 22 of the eigenvectors are 
outputted from an output end 530. 
[0165] 

15 Fig. 31 shows functional blocks of the transfer 

function calculating means 502 of Fig. 28. The signals x max 
and x m in are respectively inputted from input ends 532 and 534 , 
while the coefficients u U/ u 21/ u X2 and u 22 of the 
eigenvectors are inputted from an input end 536. FFT means 

20 538 applies the fast Fourier transformation to the signal x max . 
Complex conjugate calculating means 540 calculates a complex 
conjugate X* max of X max . Power spectrum calculating means 542 
calculates X max • X* max = | X max | 2 . Ensemble averaging means 544 
calculates E[|X max | 2 ]. FFT means 546 applies the fast Fourier 

25 transformation to the signal x min . Complex conjugate 

calculating means 548 calculates a complex conjugate X* min of 
Xmin- Power spectrum calculating means 550 calculates 
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X m irr X* min = | X min | 2 . Ensemble averaging means 552 calculates 
E[|X min | 2 ] . 

[0166] 

An input end 554 is inputted with an output signal y L 
5 of the microphone MC(L) (or output signal E L of the subtracter 
48) . FFT means 558 applies the fast Fourier transformation 
to the signal y L (or signal E L ) . Cross spectrum calculating 
means 560 derives X* max -Y L (or X* max -E L ), and ensemble 
averaging means 562 derives E[X* max -Y L ] (or E [X* max - E L ] ) . Cross 
10 spectrum calculating means 564 derives X* m i n -Y L (or X* min "E L ), 
and ensemble averaging means 566 derives E[X* min -Y L ] (or 
E[X* min -E L ] ) . 

[0167] 

An input end 568 is inputted with an output signal y R 
15 of the microphone MC(R) (or output signal E R of the subtracter 
50) . FFT means 570 applies the fast Fourier transformation 
to the signal y R (or signal E R ) . Cross spectrum calculating 
means 572 derives X* max -Y R (or X* max -E R ), and ensemble 
averaging means 574 derives E[X* max -Y R ] (or E[X* max -E R ]) . Cross 
20 spectrum calculating means 576 derives X* min -Y R (or X* min -E R ), 
and ensemble averaging means 578 derives E[X* min -Y R ] (or 
E[X* min -E R ] ) . 

[0168] 

Composite transfer function calculating means 580 
25 derives the first term on the right side of the equation 

(106) {or the equation (114)} based on E[|X max | 2 ], E[|X min | 2 ]/ 
E[X* max -Y L ] (or E[X* max -E L ] ) , E[X* min -Y L ] (or E[X* min -E L ] ) , 
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E[X* max -Y R ] (or E[X* max -E R ] ) , and E[X* min -Y R ] (or E[X* min -E R ]) which 
are derived as described above. Averaging means 582 averages 
the coefficients Uu, u 2 i, u X2 and u 22 of the eigenvectors 
individually to derive E [vn] , E [v i2 ] , E [v 21 ] and E [v 22 ] . 
5 [0169] 

Based on the outputs of the composite transfer 
function calculating means 580 and the averaging means 582, 
transfer function calculating means 584 performs a 
calculation of the right side of the equation (106) {or the 

10 equation (114) } to derive individual transfer functions H LL , 
Hul/ H LR and (or their estimated errors AH LL/ AH RL , AH LR and 

AH^) . Inverse FFT means 586 applies the inverse fast 
Fourier transformation to the derived transfer functions H LL , 
Hiu,, H lr and (or their estimated errors AH LL/ AH^, AH LR and 

15 AH^) to derive corresponding impulse responses h LL/ h^, h LR 
and hra (or their estimated errors Ah LL , Ah^, Ah LR and Ah^) , 
and outputs them from output ends 588, 590, 592 and 594, 
respectively. As shown in Fig. 32, the transfer function 
calculating means 584 and the inverse FFT means 586 may be 

20 exchanged therebetween in their positions. 
[0170] 

With respect to the thus structured stereo echo 
canceller 16, 24 of Fig. 28, the results of carrying out 
adaptive type operation simulations are shown in Figs. 33 to 
25 48 in connection with one audio transfer system. Figs. 33 to 
4 0 show time-domain variations in echo cancellation amount, 
while Figs. 41 to 48 show time-domain variations in transfer 



- 205 - 



function estimated error. Herein, the simulations were 
conducted under the following conditions. 

• Sampling Frequency: 11.025kHz 

• The Number of Samples in One Frame: 4096 samples 

5 • The Number of Frames in One Block: variable (2 frames, 4 
frames, 8 frames, 16 frames) 

• Update Period of Filter Characteristic: per block (about 
0.75seconds in case of the number of frames in one block 
being two, about 1.5 seconds in case of 4 frames, about 3 

10 seconds in case of 8 frames, about 6 seconds in case of 16 
frames) 

• The Mean Number of Times of Ensemble Averaging in One 
Block: 31 times in case of the number of frames in one block 
being two, 63 times in case of 4 frames, 127 times in case of 

15 8 frames, 255 times in case of 16 frames {For the purpose of 
increasing the mean number of times, as shown in Fig. 49, one 
frame is divided into 16 intervals, and data corresponding to 
one frame is extracted from the head of each divisional 
interval while overlapping data extraction successively, 

2 0 thereby to derive individual parameter values to be ensemble- 
averaged, and the derived individual parameter values are 
respectively ensemble -averaged in one block. Therefore, the 
mean number of times N of ensemble averaging becomes such 
that N=(16 x the number of frames in one block - 1) . } 

25 In any case, the filter characteristics are not set 

in the first block, and the initial setting is implemented in 
the second block, and thereafter, updating is performed per 
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block. The axis of ordinates (dB) is defined such that OdB 
represents the initial state where the filter characteristics 
are not set. Differences in simulation condition with 
respect to Figs. 33 to 48 are as follows. 
5 [Table 2] 



Fiaure Number 


Presence / Absence 
of Double Talk 


Number of Frames 
in One Block 


Fig. 33, Fig. 41 


Absent 


2 


Fig. 34, Fig. 42 


4 


Fig. 35, Fig. 43 


8 


Fig. 36, Fig. 44 


16 


Fig. 37, Fig. 45 


Present 


2 


Fig. 38, Fig. 46 


4 


Fig. 39, Fig. 47 


8 


Fig. 40, Fig. 48 


16 



[0171] 

The simulation results of Figs. 33 to 48 are 
10 considered . 

(1) In Case of Absence of Double Talk 

Since the rising speed of the echo cancellation 
amount is Fig. 33 > Fig. 34 > Fig. 3 5 > Fig. 36, the rising 
speed of the echo cancellation amount becomes faster as one 
15 block (update period) becomes shorter (the number of frames 

per block becomes smaller) . When the number of frames in one 
block is 2, 4 or 8, the echo cancellation amount of about 
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25dB is obtained (Fig. 33, 34 or 35). When the number of 
frames in one block increases, i.e. 16 frames, the echo 
cancellation amount requires a long time for reduction 
thereof (Fig. 36) . Since the estimated error convergence 
5 speed is Fig. 41 > Fig. 42 > Fig. 43 > Fig. 44, the estimated 
error convergence speed becomes faster as one block becomes 
shorter . 

(2) In Case of Presence of Double Talk 

When the number of frames in one block is 2 or 4 

10 frames, the echo cancellation amount is not increased (Fig. 

3 7 or 38) , and the estimated error is not converged (Fig. 4 5 
or 46) and thus can not be estimated. When the number of 
frames in one block is 8, the echo cancellation amount of 
about 15dB is obtained (Fig. 39) , and the estimated error is 

15 converged to about -6dB (Fig. 47) . When the number of frames 
in one block is 16, the echo cancellation amount of about 
17dB is obtained (Fig. 40) , and the estimated error is 
converged to about lOdB and thus the fairly stable estimation 
can be achieved. 

20 From the foregoing simulation results, the 

followings can be said. 

(a) When the double talk is not detected, the 
convergence of the estimated error can be quickened by 
relatively shortening the update period of the filter 

25 characteristic . 

(b) When the double talk is detected, the estimated 
error can be fully converged by relatively prolonging the 
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update period of the filter characteristic. 

Therefore, as described above, the transfer function 
calculating means 502 of Fig. 28 makes relatively longer the 
update period of the filter characteristics of the filter 
5 means 40-1 to 40-4 while the double talk is detected, whereas 
makes relatively shorter the update period of the filter 
characteristics while the double talk is not detected. This 
makes it possible to fully converge the estimated errors when 
the double talk exists, and further, quicken the convergence 
10 of the estimated errors when there is no double talk. 
[0172] 

Fig. 50 shows a modification of the stereo echo 
canceller 16, 24 of Fig. 28. The same symbols are used with 
respect to those portions common to Fig. 28. In this 

15 modification, an orthogonal i zing filter is disposed on signal 
lines of loudspeakers SP(L) and SP(R). An inverse filter 596 
has an inverse characteristic of the orthogonal i zing filter 
500 {the inverse filter characteristic V of the foregoing 
equation (96)}, thereby to restore output signals x ma x and x min 

20 of the orthogonal i zing filter 500 to the original signal x L 
and x R , and feeds them to the loudspeakers SP (L) and SP(R) . 

In the foregoing embodiments, the number of the 
loudspeakers is two and the number of the microphones is two. 
However, it may also be configured that the number of the 

25 loudspeakers is two, while the number of the microphones is 
one. Fig. 51 shows a structural example as a result of 
modifying Fig. 1 in such a manner. The same symbols are used 
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with respect to those portions common to Fig. 1. Left/right 
two-channel stereo signals x L and x R transmitted from the 
spot on the counterpart side and inputted into line input 
ends LI (L) and LI (R) are outputted from sound output ends 
5 SO(L) and SO(R) as they are (i.e. not through sum/difference 
signal producing means 52) , and reproduced at loudspeakers 
SP(L) and SP (R) , respectively. 

Filter means 40-1 is set with an impulse response 
corresponding to a transfer function H L between the 

10 loudspeaker SP(L) and a microphone MC and performs, using 
such an impulse response, a convolution calculation of a 
signal x L to be outputted from the sound output end SO(L) # 
thereby producing an echo cancel signal EC1 corresponding to 
a signal y L obtained such that the signal x L outputted from 

15 the sound output end SO(L) is reproduced at the loudspeaker 
SP(L), collected by the microphone MC and inputted into a 
sound input end SI. Filter means 40-3 is set with an impulse 
response corresponding to a transfer function H R between the 
loudspeaker SP (R) and the microphone MC and performs, using 

20 such an impulse response, a convolution calculation of a 
signal x R to be outputted from the sound output end SO(R), 
thereby producing an echo cancel signal EC3 corresponding to 
a signal y R obtained such that the signal x R outputted from 
the sound output end SO(R) is reproduced at the loudspeaker 

25 SP(R), collected by the microphone MC and inputted into the 
sound input end SI. An adder 44 performs a calculation of 
EC1+EC3 . A subtracter 48 subtracts an echo cancel signal 
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EC1+EC3 from a collected audio signal y (=y L +y R ) of the 
microphone MC inputted from the sound input end SI, thereby 
to perform echo cancellation. An echo-canceled signal e 
(=e L +e R ) is outputted from a line output end LO and 
5 transmitted toward the spot on the counterpart side. 
[0173] 

The sum/difference signal producing means 52 
performs addition, using an adder 54, of the left/right two- 
channel stereo signals x L and x R inputted into the line input 

10 ends LI (L) and LI (R) so as to produce a sum signal x M (=x L +x R ) , 
while performs subtraction thereof using a subtracter 56 so 
as to produce a difference signal x s {=x L -x R (or it may also 
be x R -x L ) } . Transfer function calculating means 58 
implements a cross- spectrum calculation between the sum 

15 signal x M and the difference signal x s produced by the 

sum/difference signal producing means 52 and the signal e 
outputted from the subtracter 48 and, based on this cross- 
spectrum calculation, sets filter characteristics (impulse 
responses) of the filter means 40-1 and 40-3. Specifically, 

20 upon starting the system, the filter characteristics of the 

filter means 40-1 and 40-3 are not set, i.e. coefficients are 
all set to zero, so that the echo cancel signals EC1 and EC3 
are zero, and thus the collected audio signal of the 
microphone MC itself is outputted from the subtracter 48. 

25 Therefore, at this time, the transfer function calculating 

means 58 performs the cross -spectrum calculation between the 
sum signal x M and the difference signal x s produced by the 
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sum/difference signal producing means 52 and the collected 
audio signal e of the microphones MC outputted from the 
subtracter 48 and, based on this cross-spectrum calculation, 
derives transfer functions of two audio transfer systems 
5 between the loudspeakers SP(L) and SP (R) and the microphone 
MC, respectively, and implements initial setting of the 
filter characteristics of the filter means 40-1 and 40-3 to 
values corresponding to such transfer functions . After the 
initial setting, since the echo cancel signals are produced 

10 by the filter means 40-1 and 40-3, the echo cancel error 
signal e corresponding to a difference signal between the 
collected audio signal of the microphone MC and the echo 
cancel signal EC1+EC3 is outputted from the subtracter 48. 
Therefore, at this time, the transfer function calculating 

15 means 58 performs the cross -spectrum calculation between the 
sum signal x M and the difference signal x s produced by the 
sum/difference signal producing means 52 and the echo cancel 
error signal e outputted from the subtracter 4 8 and, based on 
this cross-spectrum calculation, derives estimated errors of 

2 0 the transfer functions of the two audio transfer systems 

between the loudspeakers SP(L) and SP(R) and the microphone 
MC, respectively, and updates the filter characteristics of 
the filter means 40-1 and 40-3 to values that cancel the 
estimated errors, respectively. By repeating this updating 

25 operation per prescribed time period, the echo cancel error 
can be converged to a minimum value. Further, even if the 
transfer functions change due to movement of the microphone 
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positions or the like, the echo cancel error can be converged 
to a minimum value by sequentially updating the filter 
characteristics of the filter means 40-1 and 40-3 depending 
thereon . 
5 [0174] 

Correlation detecting means 60 detects a correlation 
between the sum signal x M and the difference signal x s based 
on a correlation value calculation or the like, and stops 
updating of the foregoing filter characteristics when the 

10 correlation value is no less than a prescribed value. When 

the correlation value becomes lower than the prescribed value, 
updating of the foregoing filter characteristics is restarted. 
Also in the embodiments other than Fig. 1, it can be 
configured that the number of the loudspeakers is two, while 

15 the number of the microphones is one. 
[0175] 

The stereo echo canceller 16, 24 shown in each of 
the foregoing embodiments can be formed by the dedicated 
hardware or can also be realized through software processing 

2 0 in a general computer. For example, the functions of the 
respective blocks as shown in Fig. 1 and so forth can be 
accomplished by a CPU (Central Processing Unit) and storing 
means such as a RAM or ROM constituting the computer. Namely, 
the CPU may be caused to function as the echo canceller 

2 5 according to a program stored in the storing means such as 
the ROM or RAM. 

[0176] 
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In the foregoing embodiments, the description has 
been made about the case where the two- channel stereo signals 
are handled. However, the echo cancellation can also be 
implemented using the technique of this invention with 
5 respect to those signals of three channels or more having a 
correlation with each other. 



